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Description 

Technical Field 

5 [0001] The present invention relates to novel G protein-coupled receptors and genes thereof, and production and 
uses thereof. 

Background Art _ 
10 [0002] G protein-coupled receptor is a generic name for the group of cell membrane receptors transducing signals 
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teristic of seven transmembrane domains in a molecule, and thus called also as "a seven-transmembrane receptor". 
The G protein-coupled receptor transmits the information consisting of various physiologically active substances into 
cells across the cell membrane via the activation of the trimer-type GTP-binding protein and the change of the intrac- 

15 ellular second messengers caused thereby. Well-known intracellular second messengers that are regulated by the 
trimer-type GTP-binding protein include cAMP mediated by adenylate cyclase, and Ca 2+ mediated by phospholipase 
C. Recent studies have shown that many types of intracellular proteins serve as the targets thereof; for example, the 
regulation of channels and activation of phosphorylation enzymes are mediated by the trimer-type GTP-binding protein 
(Annu. Rev. Neurosci. (S7) 20:399). There are a wide variety of substrates (ligands) for the G protein -co up led receptor, 

20 for example, protein hormone; chernokine; peptide; amine; substances derived from lipids; and protease, such as 
thrombin, is also one such example. The number of human G protein-coupled receptors whose genes have been 
identified recently, is a little under 300, excluding the sensory-type receptors. However, the number of G protein-coupled 
receptors to which the ligands have been identified is only about 140 types. Thus, there are 100 or more, ligand- 
unknown, "orphan G protein-coupled receptors". The human genome has been assumed to contain at least 400 types, 

25 and possibly up to 1000 types of G protein-coupled receptors (Trends Pharmacol. Sci. (97) 18:430). This means that 
the number of functionally unknown orphan G protein-coupled receptors can be exploding accompanied by the rapid 
progress of the genome analysis. 

[0003] Ninety % or more drugs that have so far been produced by the pharmaceutical companies in the world aim 
at the interaction in extracellular spaces, and low-molecular-weight drugs comprises the majority of those relating to 

30 G protein-coupled receptors. The reason is that the G protein-coupled receptor- related diseases include many types 
of diseases, such as those of the cerebral nervous system, circulatory system, digestive system, immune system, 
locomotor system, urinary system, and genital system, including genetic diseases. Thus, in recent years, many phar- 
maceutical companies retain their orphan G protein-coupled receptors found through the genome analysis, and are 
competing fiercely with each other to reveal the ligands and physiological functions. Based on this, successful cases 

35 of physiological screening of ligands to some novel G protein-coupled receptors have begun to be reported recently. 
For example, the cases of a calcitonin-related peptide receptor (J. Biol. Chem. (96) 271 :1 1325), orexin (Cell (98) 92: 
573), and prolactin-releasing peptide (Nature. (98) 393:272) gave a great impact to basic studies in the field of life 
science,- . r " 

[0004] In particular, as potential new targets to bring about the drug development, the orphan G protein-coupled 

40 receptors have become a center of attraction. In general, since there are no specific ligands to the orphan G protein- 
coupled receptors, it has been difficult to develop agonists or antagonists . However, in recentyears, creation of orphan 
G protein-coupied receptor-targeted drugs by combining the enriched compound libraries and high-throughput screen- 
ing methods has been proposed (Trends Pharmacol. Sci. (97) 18:430, Br. J. Pharm. (98) 125:1387). Specifically, in 
the creation comprises identifying physiological agonists of an orphan G protein-coupled receptor identified by genetic 

45 engineering, by functional screening utilizing alterations in the level of an intracellular messenger, cAMP or Ca 2+ , as 
an index, and then analyzing the in vivo functions. In this method, high-throughput screening achieved by using a 
compound library allows theoretically to discover surrogate agonists and antagonists specific to the orphan G protein- 
coupled receptor, and further, to develop therapeutic agents for particular diseases. 

so Disclosure of the Invention 

[0005] The present invention was achieved considering the present situation surrounding G protein-coupled recep- 
tors, and an objective thereof is to provide novel G protein-coupled receptors and their genes, and a method for pro- 
ducing and uses of them. Another objective of the present invention is to provide these molecules as targets for the 
55 study of drug development. 

[0006] The present inventors studied strenuously to achieve the above-mentioned objectives, and successfully iso- 
lated nine novel genes comprising nucleotide sequences encoding hydrophobic regions considered to be seven trans- 
membrane domains, which are characteristic of the G protein-coupled receptors, by polymerase chain reaction using 
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cDNAs from human tissues as templates. These genes and the proteins as the translation products can be used in 
the screening of ligand and of agonist or antagonist useful as a pharmaceutical, or can be used for diagnosing diseases 
relating to these genes. 

[0007] Thus, the present invention relates to novel G protein-coupled receptors and the'genes encoding them, and 
5 the uses and production thereof. More specifically, the present invention provides: 

(1) a DNA that encodes a guanosine triphosphate-binding protein-coupled receptor, wherein said DNA is selected 
from the group consisting of the following (a) to (d): 

w (a) a DNA encoding a protein comprising the amino acid sequence of any one of SEQ ID NOs: 1 to 4 and 17 

to 21; 

(b) a DNA comprising a coding region of the nucleotide sequence of any one of SEQ ID NOs: 5 to 8 and 22 to 26; 

(c) a DNA encoding a protein comprising the amino acid sequence of any one of SEQ ID NOs: 1 to 4 and 17 
to 21 in which one or more amino acids are substituted, deleted, added, and/or inserted; and 

15 (d) a DNA hybridizing under stringent conditions to the DNA comprising the nucleotide sequence of any one 

of SEQ ID NOs: 5 to 8 and 22 to 26; 

(2) a DNA encoding a partial peptide of -a protein comprising the amino, acid sequence of any one of SEQ ID NOs: 
1 to 4 and 17 to 21; \ 

20 (3) a vector comprising the DNA of any one of (1 ) and (2) ; 

(4) a transformant carrying the DNA of any one of (1) and (2) or the vector of (3); 

(5) a protein or a peptide encoded by the DNA of any one of (1) and (2); 

(6) a method for producing the protein or the peptide of (5), said method comprising the steps of culturing the 
transformant of (4) and recovering an expressed protein or peptide from the transformant or culture supernatant 

25 thereof; 

(7) a method of screening for ligands that bind to the protein of (5), said method comprising the steps of: 

(a) contacting a test sample with the protein or the peptide of (5); and 

(b) selecting compounds that binds to said protein or said peptide; 

30 

(8) a method of screening for compounds that have activity of inhibiting the binding between the protein of (5) and 
a ligand thereof, said method comprising the steps of: 

(a) contacting the protein of (5) or a partial peptide thereof with the ligand in the presence of a test sample 
35 and detecting a binding activity of said protein or said partial peptide with said ligand; and 

(b) selecting compounds that reduces the binding activity detected in step (a) as compared with a binding 
activity detected in the absence of the test sample; 

(9) a method of screening forcompounds that inhibit or enhance activity of the protein of (5), said method comprising 
40 the steps of: 

(a) contacting a ligand of said protein with cells expressing said protein in the presence of a test sample, 

(b) detecting an alteration in the cells that results from binding of said ligand to said protein, and 

(c) selecting compounds that suppress or enhance the alteration detected in step (b) as compared with an 
45 alteration detected in the cells in the absence of the test sample; 

(1 0) the method of (8) or (9), wherein the alteration in cells is a change in cAMP concentration or calcium concen- 
tration; 

(11) an antibody binding to the protein of (5); 

so (12) a compound isolated by the method of any one of (7) to (10); 

(1 3) a pharmaceutical composition comprising the compound of (1 2) as an active ingredient; 

(14) the pharmaceutical composition of (13), wherein said pharmaceutical composition is formulated for the treat- 
ment of a disease selected from the group consisting of cancer, cirrhosis, and Alzheimer's disease; 

(15) a polynucleotide comprising at least 15 nucleotides, wherein said polynucleotide is complementary to the 
55 DNA comprising the nucleotide sequence of any one of SEQ ID NOs: 5 to 8 and 22 to 26 or a complementary 

strand thereof; 

(16) a method for diagnosing a disease selected from the group consisting of cancer, cirrhosis, and Alzheimer's 
disease, said method comprising the steps of detecting expression of the DNA of (1) in tissues related to the 



3 



^wsnncirv <f=P 1243648A1 i > 



EP 1 243 648 A1 



disease derived from a subject, or mutation in the DNA of (1) in the subject; and 

(17) a agent for diagnosing a disease selected from the group consisting of cancer, cirrhosis, and Alzheimer's 
disease, said agent comprising the antibody of (1 1 ) or the nucleotide of (1 5). 

5 [0008] As used herein, the term "G protein-coupled receptor" means a cell membrane receptor transducing signals 
into cells via the activation of the GTP-binding protein. 

[0009] As used herein, the term "ligand" means a physiological substance binding to the G protein-coupled receptor 
and transducing signals into cells. Herein, the term "physiological substance" means a compound bound to the G 
protein -coup led receptor in vivo. - 
10 [0010] As used herein, the term "agonist" means a compound capable of binding to the G protein-coupled receptor 
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occurring compounds. 

[0011] As used herein, the term "antagonist" means a compound inhibiting the binding' of ligand to the G protein- 
coupled receptor or inhibiting the signal transduction into cells, including biological substances, artificially synthesized 

is compounds, and naturally occurring compounds. 

[0012] The present invention provides novel G protein-coupled receptors and the DNAs encoding the proteins. The 
nine human cDNA clones, isolated by the present inventors and included by the present invention, were named 
"GPRv8", "GPRV12", ,, GPRv16V , GPRv21 ,, ; ,l GPRv40 , \ "GPRv47\ "GPRv51", "GPRv71", and M GPRv72" (as required, 
these clones are collectively referred to as "GPRv"). The nucleotide sequences of, the cDNAs are shown in SEQ ID 

20 NOs: 5 to 8 and 22 to" 26; the amino acid sequences of the proteins encoded by thecDNAs are shown in SEQ ID NOs: 
1 to 4 and 17 to 21. 

[0013] A result obtained by BLAST search showed that amino acid sequence of all the proteins encoded by GPRv 
cDNAs exhibited significant homology to those of known G protein-coupled receptors. Specifically, "GPRvS" exhibited 
36% homology to HUMAN VASOPRESSIN V1 B RECEPTOR (P47901 , 424 aa); "GPRv12" exhibited 27% homology 

25 to RAT 5-HYDROX YTRYPTAM IN E 6 RECEPTOR (P31388, 436 aa); "GPRv16" exhibited 28% homology to MOUSE 
GALANIN RECEPTOR TYPE 1 (P56479, 348 aa); n GPRv21" exhibited 30% homology to BOVIN NEUROPEPTIDE Y 
RECEPTOR TYPE 2 (P79113, 384 aa); M GPRv40 H exhibited 34% homology to OXYTOCIN RECEPTOR (P97926, 388 
aa); "GPRv47" exhibited 43% homology to GPRX_ORYLA PROBABLE G PROTEIN-COUPLED RECEPTOR (Q91 1 78, 
428 aa); "GPRv51 M exhibited 37% homology to PROBABLE G PROTEIN-COUPLED RECEPTOR RTA (P23749, 343 

so aa); "GPRv7r exhibited 45% homology to Chicken P2Y PURINOCEPTOR 3 (P2Y3) (Q98907, 328 aa) ; "GPRv72" 
exhibited 30% homology to ALPHA-1 A ADRENERGIC RECEPTOR (002824, 466 aa). 

[0014] Further, all the proteins encoded by GPRv cDNAs (hereinafter also may be referred to as "GPRv protein") , 
isolated by the present inventors, contained hydrophobic regions, which were assumed to correspond to the seven 
transmembrane domains characteristic of the G protein-coupled receptor. Based on thesef indings, all the GP Rv cDNAs 

35 can be considered to encode proteins belonging to the G protein-coupled receptor family. The G protein-coupled re- 
ceptors have the activity for transducing signals into cells via the activation of the G protein, which is mediated by the 
ligand. As described above, the receptor are involved in many types of diseases, such as those of the cerebral nervous 
system, circulatory system, digestive system, immune system, locomotor system, urinary system, and genital system, 
including genetic diseases. Accordingly, the GPRv proteins can be used to screen for agonists and antagonists regu- 

40 lating the functions of GPRv proteins, and. thus become important targets of drug development for the above diseases. 
[0015] The present invention also provides proteins functionally equivalent to the GPRv proteins. As used herein, 
the term "functionally equivalent" means that a protein of interest has biological properties identical to those of the 
GPRv proteins. The biological properties of GPRv proteins include the activity of transducing signals into cells via the 
activation of the trimer-type GTP-binding protein. According to the types of activated systems of intracellular signal 

45 transduction, the trimer-type GTP-binding proteins are categorized into three classes, namely, Gq type that increases 
the Ca 2+ level, Gs type that increases the cAMP level, and Gi type that reduces the cAMP level (Trends Pharmacol, 
Sci. (99) 20:118). Thus, it can be assessed whether the protein of interest Has biological properties identical to those 
of GPRv proteins, for example, by detecting concentration changes of cAMP or calcium in cells depending on the 
activation. 

so [0016] In an embodiment, the method for preparing a protein functionally identical to the GPRv protein includes a 
method of introducing mutations in the amino acids sequence of the protein. Such method includes, for example, site- 
directed mutagenesis (Current Protocols in Molecular Biology, edit. Ausubel et al. (1987) Publish. John Wiley & Sons 
Section 8.1-8.5)). The amino acid mutations in the protein can be also occurred naturally. The present invention includes 
mutant proteins, regardless of being generated artificially or naturally, in which one or more amino acids have been 

55 substituted, deleted, inserted and/or added in the amino acid sequences (SEQ ID NOs: 1 to 4 and 17 to 21) of GPRv 
proteins, but the mutant proteins are functionally equivalent to the GPRv proteins. There is no limitation on the number 
of amino acid mutations and positions of the mutations in the proteins, as far as the functions of GPRv proteins are 
retained. The number of mutations is assumed to range typically within 10% of the entire amino acids, preferably within 
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5% of the entire amino acids, further preferably within 1% of the entire amino acids. 

[0017] In another embodiment of the invention, the method for preparing a protein functionally equivalent to the 
GPRv protein includes a method using the hybridization technique or gene amplification technique. Specifically, those 
skilled in the art can typically isolate a DNA having high homology to the DNA sequence encoding the GPRv protein 

5 (SEQ ID NOs: 5 to 8 and 22 to 26) or a partial sequence thereof from a DNA sample derived from a homologous or 
heterologous using the hybridization technique (Current Protocols in Molecular Biology, edit. Ausubel et al. (1987) 
Publish. John Wiley & Sons Section 6.3-6.4)- and then obtain a protein functionally equivalent to the GPRv protein. 
Thus, the protein of the present invention also include a protein encoded by a DNA capable of hybridizing to the DNA 
encoding the GPRv protein, which is functionally equivalent to the GPRv protein. 

w [0018] Organisms to be used fonsolating such a protein include, . for example, rat, mouse, rabbit, chicken, pig, cattle, 
and so forth, in addition to human, but not limited thereto. 

[0019] Typical stringent hybridization conditions for isolating a DNA encoding a protein functionally equivalent to the 
GPRv protein are those of "1x SSC, 0.1% SDS, 37°C" or thejike; more stringently, those of "0.5x SSC, 0.1% SDS, 
42°C n or the like; much more stringently, those of °0.2x SSC, 0.1% SDS, 65°C" orthe like. As the hybridization conditions 

is become more stringent, a DNA with higher homology to the probe sequence can be expected to be isolated. However, 
the above combinations of SSC, SDS, and temperature are only examples, and those skilled in the art can achieve 
the stringencies equivalent to the above by appropriately combining the above or other factors determining the hybrid- 
ization stringency (for example, probe concentration, probe length, time of hybridization reaction, and so forth). 
[0020] The protein encoded by a DNA isolated by using such hybridization, technique typically has high homology 

20 of amino acid sequence to those of the GPRv protein. The term "high homology" means the degree of sequence 
homology of at least 40% or higher, preferably 60% or higher, further preferably 80% or higher (for example, 90% or 
higher, or 95% or higher). 

[0021 ] Identity of amino acid sequence or nucleotide sequence can be determined with the BLAST algorithm of Karlin 
and Altschul (Proc. Natl. Acad. Sci. USA 90:5873-5877, 1 993). Based on this algorithm, the programs, BLASTN and 
25 BLASTX, have been developed (Altschul et al. J. Mol. Biol. 215: 403-410, 1990). When nucleotide sequences are 
analyzed by BLASTN based on BLAST, the parameters are set, for example, as follows: score= 100; and wordlength= 
12. Alternatively, when amino acid sequences are analyzed by BLASTX based on BLAST, the parameters are set, for 
example, as follows: score= 50; and wordlength= 3. When BLAST and the Gapped BLAST program are used for the 
analysis, the default parameters are used in each program. The specific techniques used in these analysis methods 

30 are already known (http://www.ncbi.nlm.nih.gov.). 

[0022] Further, primers are designed based on a part of the DNA sequence (SEQ ID NOs: 5 to 8 and 22 to 26) 
encoding the GPRv protein, a DNA fragment having high homology to the DNA sequence encoding the GPRv protein 
is isolated by the gene amplification technique (PGR) (Current protocols in Molecular Biology, edit. Ausubel et al. (1 987) 
Publish. John Wiley & Sons Section 6.1-6.4), and then the protein functionally equivalent to the GPRv protein can be 
. 35 obtained. - 

[0023] The present invention also includes partial peptides of the protein of the present invention. These partial 
peptides include peptides binding to the ligand but not transducing signals. An affinity column prepared using such a 
peptide can be used suitably for ligand screening. In addition, the partial peptides of the protein of the present invention 
can be used for preparing antibodies. The partial peptides of the present invention can be produced, for example, by 

40 using genetic engineering techniques, known peptide synthetic methods, or methods of digesting the protein of the 
present invention with an appropriate peptidase. The partial peptides of the present invention typically consist of 8 or 
more amino acid residues, preferably 12 or more amino acid residues (for example, 15 or more amino acid residues). 
[0024] The protein of the present invention can be prepared as a recombinant protein or natural protein. The recom- 
binant protein can be prepared, for example, as follows, by introducing a DNA encoding the protein of the present 

45 invention, which has been inserted in a vector, into an appropriate host cell and purifying the protein expressed in the 
transformant. On the other hand, the natural protein can be prepared, for example, by using the affinity column, in 
which an antibody against the protein of the present invention has been immobilized, as follows (Current Protocols in 
Molecular Biology, edit. Ausubel et al. (1987) Publish. John Wiley & Sons Section 16.1 -16.1 9). The antibody to be used 
in the affinity purification may be a polyclonal or monoclonal antibody. Further, the protein of the present invention can 

so be prepared by in wYrotranslation (see, for example, "On the fidelity of mRNA translation in the nuclease-treated rabbit 
reticulocyte lysate system. Dasso, M.C., Jackson, R. J. (1989) NAR 17:3129-3144"), orthe like. 
[0025] The present invention also provides DNAs encoding the above-mentioned proteins of the present invention. 
There is no limitation on the type of DNA of the present invention as far as it can encode the protein of the present 
invention; comprising cDNA, genomic DNA, chemically synthesized DNA, etc. Further, when it encodes the protein of 

55 the present invention, a DNA having any nucleotide sequence based on the degeneration of genetic code is included. 
The DNA of the present invention can be isolated according to a standard method, such as the hybridization method 
using a DNA sequence encoding the GPRv protein (SEQ ID NOs: 5 to 8 and 22 to 26) or a partial sequence thereof 
as a probe or PCR method using primers synthesized based on these DNA sequence, as described above. 
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[0026] In addition, the present invention also provides a vector, in which the DNA of the present invention has been 
inserted. There is no limitation on the type of vector of the present invention, as far as it stably retains the inserted 
DNA. For example, when E. coli is used as a host, the preferable cloning vector is pBIuescript vector (Stratagene) or 
the like. When the vector is used for the purpose of producing the protein of the present invention, an expression vector 

5 is especially useful. There is no limitation on the type of expression vector, as far as it direct the expression of the 
protein in vitro, in E. coli, in culture cells, in the living body, for example, pBEST (Promega) for in vitro expression; pET 
(Invitrogen) for in E. coli, ; pME18S-FL3 (GenBank Accession No. AB009864) for in culture cells; and, pME18S (Mol 
Cell Biol. 8:466-472(1988)) for in the living body of an organism are preferred vector. The insertion of the DNA of the 
present invention into a vectorcan be achieved according to astandard method, for example, by ligation using restriction 

10 enzyme sites {Current protocols in Molecular Biology, edit. Ausubel et al. (1 987) Publish. John Wiley & Sons. Section 

A -4 A -4 A 4 4 \ 

[0027] Also, the present invention provides a transformant containing the DNA of the present invention or the vector 
of the present invention. There is no limitation on the type of host cell into which the vector of the present invention is 
to be introduced, and various types of host cells can be used depending on the purposes. Exemplary eukaryotic cells, 
15 in which the protein is to be expressed at high levels, include COS cell and CHO cell. The vector can be introduced 
into the host cell, by a known method such as, for example, calcium-phosphate precipitation method, electropo ration 
method, (Current protocols in Molecular Biology, edit. Ausubel et al. (1987) Publish. John Wiley & Sons. Section 
9.1-9.9), a method with lipofectamine (GIBCO-BRL), microinjection, and so forth. . 

[0028] The present invention also provides nucleotides comprising at least 15 nucleotide residues, which is comple- 
te mentary to the DNA encoding the protein of the present invention (DNA comprising any one of the nucleotide sequences 
of SEQ ID NOs: 5 to 8 and 22 to 26 or the complementary strand thereof). The term "complementary strand" means" 
one strand complementary to the other strand of the two of double-stranded nucleic acid consisting of A:T (U in the 
case of RNA) and G:C nucleotide pairs. Further, the term "complementary" means not only being a perfect comple- 
mentary sequence in a region of at least consecutive 15 nucleotide residues, but also nucleotide sequences with at 
25 least 70% homology, preferably at least 80%, more preferably 90%, further preferably 95% of homology or higher. The 
algorithm described herein can be used for determining homology. These nucleotides can be used as probes for de- 
tecting and isolating the DNA of the present invention, and as primers for amplifying the DNA of the present invention. 
When used as the primer, it typically comprises 1 5 bp -1 00 bp, preferably of 1 5 bp -35bp of nucleotides. Alternatively, 
when used as the probe, it is at least 15 bp of nucleotide containing at least a part of the DNA of the present invention 
30 or the entire sequence. Preferably, such nucleotide specifically hybridize to the DNA encoding the protein of the present 
invention. The term "specifically hybridizing" means that a DNA hybridizes to the DNA encoding the protein of the 
present invention (SEQ ID NOs: 5 to 8 and 22 to 26) but not to DNAs encoding other proteins, undertypical hybridization 
conditions, preferably under stringent conditions. 

[0029] These nucleotides can be used for testing and diagnosing abnormalities of the protein of the present invention. 

35 For example, abnormal expression of the DNA encoding the protein of the present invention can be tested by Northern 
hybridization or RT-PCR using these nucleotides as probes or primers. The nucleotides can be used, for example, in 
the tests for cancers, cirrhosis, or Alzheimer's disease. In addition, the DNA encoding the protein of the present invention 
or the regulatory region for the expression is amplified by polymerase chain reaction (PGR) using the nucleotides as 
primers, and then abnormalities in the DNA sequence can be tested and diagnosed by using the methods such as 

40 RFLP analysis, SSCP, and sequencing. 

[0030] Moreover, the antisense DNA for suppressing expression of the protein of present invention is included in 
these nucleotides. In order to cause the antisense effect, antisense DNA comprises at least 15 bp of nucleotides or 
more, preferably 100 bp, more preferably 500 bp or more, and usually comprises 3000 bp or less, preferably 2000 bp 
or less. Such antisense DNA may be applied to the gene therapy for the disease resulting from the abnormalities 

45 (abnormalities of function or expression) of the protein of present invention and so forth. This antisense DNA can be 
prepared, for example, based on the sequence information of DNA (for example, from SEQ ID NO: 5 to 8 and 22 to 
26) encoding the protein of the present invention, by the phosphorothioate method (Stein, 1 988 Physicochemical prop- 
erties of phosphorothioate oligodeoxynucleotides. Nucleic Acids Res 16. 3209-21 (1988)), etc. 

[0031] For gene therapy : the nucleotide of a present invention can be administered to a patient by ex vivo method, 
50 jn vivo method, and so forth using virus vectors, such as a retrovirus vector, an adenovirus vector, and an adeno 
associated virus vector, and non-virus vectors, such as liposome, etc. 

[0032] Further, the present invention provides the antibody bound with the protein of the present invention. There is 
no limitation in the form of the antibody of the present invention, and a polyclonal antibody and a monoclonal antibody, 
ora partthereof having antigen affinity are also included. Moreover, the antibody of all classes is included. Furthermore, 
55 the antibody of a present invention also include special antibodies, such as a humanized antibody. 

[0033] For a polyclonal antibody, the antibody of the present invention can be obtained by synthesizing oligopeptides 
corresponding to the amino acid sequence of the protein of the present invention according to a standard, and then 
immunized to rabbit (Current protocols in Molecular Biology, edit. Ausubel et al. (1987) Publish. John Wiley & Sons. 
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Section 11.12-11.13). For a monoclonal antibody, the hybridoma cell prepared by the cell fusion of the spleen cell and 
myeloma cell of the mouse immunized using the protein expressing in E. coli and then purified according to the standard 
method, and the antibody of the present invention can be obtained from this hybridoma cell (Current protocols in 
Molecular Biology, edit. Ausubel etal. (1987) Publish. John Wiley & Sons. Section 11.4-11.11). 

5 [0034] In addition to purifying of the protein of the present invention, the antibody bound with the protein of the present 
invention may be also used for a test and a diagnosis of the abnormalities in expression or in structure of the protein 
of a present invention. Specifically, protein can be extracted from tissue, blood, or cell, and then can be used for the 
test and the diagnose for presence or absence of the abnormalities of expression or structure, via a detection of the 
protein of the present invention by Western blotting method, immunoprecipitation, ELISA, and so forth. The antibody 

10 of the present invention may be used for a test of cancer, liver cirrhosis, or Alzheimer's disease. 

[0035] Moreover, the antibody bound with the protein of the present invention may be used for the purposes of, such 
as treatment of the disease relevant to the protein of the present invention. The antibody of the present invention can 
effect as the agonist and antagonist for the protein of the present invention. When using an antibody for the purpose 
of treatment of a patient, an antibody derived from human or a humanized antibody is preferable because of little 

15 immunogenicity. An antibody derived from human can be prepared by immunizing the mouse of which immune system 
is replaced with those of human (for example, refer to "Functional transplant of megabase human immunoglobulin loci 
recapitulates human antibody response in mice" Mendez, M.J. et al. (1997) Nat. Genet. 15: 146-156). Moreover, a 
humanized antibody can be prepared by recombination with the hypervariable region of a monoclonal antibody (Meth- 
ods in Enzymology 203, 99-121 (1991)). 

20 [0036] Further, the present invention also provides a screening method for ligands binding to the protein of the present 
invention using the protein of the present invention. This screening method comprises the step of: (a) contacting a test 
sample with the protein of the present invention or a partial peptide thereof; and (b) selecting compounds binding to 
the protein or the partial peptide thereof. 

[0037] Without limiting, the test sample Include, for example, known compounds or peptides (for example, deposited 
25 in the Chemical File) whose activities as the ligands to the various-G protein-coupled receptors have not yet been 
identified or a group of random peptides which have been prepared by phage display method (J. Mot. Biol. (1 991) 222, 
301 -31 0). Further, culture supernatants of microorganisms, natural ingredients from plants or marine organisms, and, 
in addition to these, biological extracts from tissues including brain, cell extracts, expression products of gene libraries, 
but not limited thereto, can be screened. 
30 [0038] The protein of the present invention to be used for the screening can be, for example, the form displayed on 
cell surface, the form as the cell membrane fraction of the cells, or the form immobilized in an affinity column. 
[0039] Specific screening methods include many known methods such as, for example, a method of contacting a 
test sample with an affinity column of the protein of the present invention and purifying compounds bound to the protein 
of the present invention; and Western blotting method. When these methods are used, the test sample is labeled 
35 appropriately and the binding with the protein of the present invention can be detected by using the label. In addition 
to these methods, another method can be used; in which cell membranes expressing the protein of the present invention 
are prepared and immobilized on a chip, and the alterations in surface plasmon resonance, which represent the dis- 
sociation of the trimer-type GTP-binding protein during the ligand binding, are detected {Nature Biotechnology (99) 17: 
1105). 

40 [0040] Further, the binding activity between a test sample and the protein of the present invention can be detected 
for alterations as indices in cells, which is caused by the binding of the test sample to the protein of the present invention 
expressed on cell surface. Such alterations include, for example, alterations of intracellular Ca 2+ level and cAMP levels, 
but not limited thereto. Specifically, the agonist activity to the G protein-coupled receptor can be assayed by GTPyS 
binding method. 

45 [0041] In an example where this method is used, cell membranes on which the G protein-coupled receptor has been 
displayed are mixed with 400 pM 35 S-labeled GTPyS in a solution containing 20 mM HEPES (pH 7.4), 1 00 mM NaCI, 
1 0 mM MgCI 2 , and 50 uJvl GDP, the mixture is incubated either in the presence or in absence of a test sample and then 
filtrated, and the radioactivities of the bound GTP7S are compared. 

[0042] Further, the G protein-coupled receptors share the system of transducing signals into cells via the activation 
so of the trimer-type GTP-binding protein. The trimer-type GTP-binding proteins are categorized into three classes de- 
pending on the types of activated systems of intracellular signal transduction: namely, Gq type that increases the Ca 2+ 
level; Gs type that increases the cAMP level; and Gi type that reduces the cAMP. Thus, the use of a chimeric protein 
consisting of a-subunit.from Gq protein and ot-subunit from another type of G protein, or promiscuous Got proteins, 
Ga15 and Ga16, allows the positive signal in the-ligand screening to result in increased Ca 2+ levels in the pathway of 
55 Gq intracellular signal transduction. The increased Ca 2+ levels can be detected by using, as indices, altered levels of 
a reporter gene having TRE (TPA responsive element) or M RE (multiple responsive element) on upstream, dye indicator 
such as Fura-2 and Fluo-3, and fluorescent protein aequorin. Similarly, the use of a chimeric protein consisting of ot- 
subunit from Gs protein and a-subunit from another type of G protein allows the positive signal to result in increased 
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cAMP levels in the pathway of Gs intracellular signal transduction, and the increased levels can be detected by using, 
as indices, altered levels of a reporter gene having CRE (cAMP-responsive element) on upstream (Trends Pharmacol. 
ScL (99) 20:118). 

[0043] There is no limitation on the type of host cell to be used for the expression of the protein of the present invention 
5 in this screening system, and. various types of host cells can be used depending on the purposes. Such host cells 
include, for example, COS cell, CHO cell, HEK293 cell, etc. The vectors directing the expression of the protein of the 
present invention in vertebrate cells, comprising the promoter upstream of the gene encoding the protein of the present 
invention, RNA splice site, polyadenylation site, and transcription termination sequence and replication origin, and so 
forth can be preferably used. For example, pSV2dhfr (Mol. Cell. Biol. (1981) 1, 854-864), pEF-BOS (Nucleic Acids 
10 Res. (1990) 18, 5322), pCDM8 (Nature (1987) 329, 840-842), and pCEP4 (Invitrogen), containing the SV40 early 
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present invention into a vector can be achieved according to a standard method by ligation using restriction enzyme 
sites (Current protocols in Molecular Biology, edit. Ausubel et al. (1987) Publish. John Wiley & Sons. Section 
11.4-11.11). Further, the vector introduction into a host cell can be achieved by a known method, for example, such as 
15 calcium-phosphate precipitation method, electroporation method (Current protocols in Molecular Biology, edit. Ausubel 
et al. (1987) Publish. John Wiley & Sons. Section 9.1-9.9), a method with lipofectamine (GIBCO-BRL), a method with 
FuGENE6 reagent (Boehringer-Manheim), microinjection method, etc. 

[0044] Once the ligands are isolated by the above screening method for ligands binding to the protein of the present 
invention, screening of compounds inhibiting the interaction between the protein of the present invention and the ligands 

20 can be achieved. Thus, the present invention provides a screening method for compounds having the activity of inhib- 
iting the binding of the protein of the present invention and the ligand thereof. This screening method comprises the 
step of: (a) contacting the ligand with the protein of the present invention or a partial peptide thereof in the presence 
of a test sample, and detecting the binding activity of the protein or a partial peptide thereof with the ligand; and (b) 
selecting compounds reducing the binding activity detected in the step (a) relative to the binding activity in the absence 

25 of the test sample. 

[0045] Without limiting, the test sample include, for example, a group of compounds obtained by combinatorial chem- 
istry technology (Tetrahedron (1 995) 51 , 8135-8137), a group of random peptides prepared by phage display method 
(J. Mol. Biol. (1991) 222, 301 -31 0), and such. Further, culture supernatants of microorganisms and natural ingredients 
from plants or marine organisms, and in addition to these, biological extracts from tissues including brain, cell extracts, 
30 expression products of gene libraries, synthetic low-molecular-weight compounds, synthetic peptides, natural conv 
pounds, and so forth can be screened, but not limited thereto. 

[0046] The protein of the present invention to be used for the screening can be, for example, the form expressed on 
cell surface, the form in the cell membrane fraction of the cells, or the form immobilized in an affinity column. 
[0047] Specific methods that can be used for the screening include, for example, a method in which the ligand is 

35 labeled with a radioisotope or the like, and contacted with the protein of the present invention in the presence of a test 
sample, and then, based on the label linked to the ligand, compounds reducing the binding activity of the protein of 
the present invention to. the ligand are detected as compared to those detected in the absence of the test sample. 
Further, the screening can also be achieved by using the intracellular alterations as an index by the same method as 
used in the above-mentioned screening to isolate ligands capable of binding to the protein of the present invention. 

40 Specifically, the screening for a compound inhibiting the binding of the protein of the present invention with the ligand 
can be carried out by contacting cells expressing the protein of the present invention with the ligands in the presence 
of a test sample, and selecting a compound decreasing the degree of alteration in the cells as compared with those 
detected in the absence of the test sample. The cells expressing the protein of the present invention can be prepared 
by the same method as used in the above-described screening of ligands binding to the protein of the present invention. 

45 The compounds isolated by the screening can be candidates for the agonist or antagonist to the protein of the present. 
[0048] Further, the present invention provides a screening method for compounds inhibiting or enhancing the activity 
of the protein of the present invention. This screening method comprises the step of: (a) contacting the ligand to the 
protein with cells expressing the protein of the present invention in the presence of a test sample; (b) detecting an 
' alteration in the cells due to the binding of the ligand to the protein of the present invention; and (c) selecting compounds 

50 suppressing or enhancing the alteration in the cells detected in the step (b) as compared with the alteration of the cells 
in the absence of the test sample. 

[0049] Such test samples to be used, like those to be used in the above-mentioned screening method for ligands 
binding to the protein of the present invention, include a group of compounds obtained by combinatorial chemistry 
technology, a group of random peptides prepared by using phage display method, culture supernatants of microorgan- 
55 isms, natural ingredients from plants or marine organisms, biological tissue extracts, cell extracts, expression products 
of gene libraries, synthetic low-molecular-weight compounds, synthetic peptides, natural compounds, and such. Fur- 
ther, the compounds isolated by the above-mentioned screening of ligands binding to the protein of the present invention 
can be used as the test samples. The cells expressing the protein of the present invention can be prepared by the 
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same method as the above-described screening of iigands binding to the protein of the present invention. The alteration 
in the cells after contacted with the test sample can be detected by using the alteration of intracellular Ca 2+ level or 
cAMP level as an index, as with the above screening method. Further, the intracellular signal transduction can also be 
detected by using an assay system such as a reporter assay using luciferase as a reporter gene. 

5 [0050] When the result of the detection shows that the alteration in the cells contacted with a test sample is sup- 
pressed as compared to those in the cells contacted with the ligand in the absence of the test sample, the test sample 
used is determined to be a compound inhibiting the activity of the protein of the present invention. Conversely, when 
the test sample enhances the alteration in the cells, the compound is determined to be a compound enhancing the 
activity of the protein of the present invention. The term "enhancing or inhibiting the activity of protein of the present 

10 invention" means that, regardless of a direct or an indirect interaction to the protein of the present invention, the inter- 
action results in the enhancement or inhibition of the activity of protein of the-present invention. Accordingly, the com- 
pounds isolated by the screening include compounds acting on the protein of the present invention or the ligand and 
inhibiting or enhancing the activity of the protein of the present invention by inhibiting or enhancing the binding, as well 
as compounds which do not inhibit nor enhance the binding itself but result in the inhibition or enhancement of the 

15 activity of the protein of the present invention. Such compounds include, for example, compounds which do not inhibit 
nor enhance the binding of the protein of the present invention and the ligand but inhibit or enhance the pathway of 
intracellular signal transduction. 

[0051] When the compounds isolated by the screening method of the present invention are used as pharmaceuticals, 
the isolated compound not only can be directly administered itself to patients but also can be administered as phar- 

20 maceutical compositions which have been formulated by a known pharmaceutical method. For example, the compound 
can be formulated, in a form suitable for oral or parenteral administration, as a pharmaceutical composition obtained 
by combining the compound with pharmaceutical^ acceptable carrier (for example, excipient, binder, disintegrator, 
flavor, cofrigent, emulsifier, diluent, solubilizer, etc.), or preparations, such as tablet, pill, powder, granule, capsule, 
troche, syrup, liquid drug, emulsion, suspension, injection (e.g. liquid drug and suspension), suppository, inhalant, 

25 percutaneous absorbent, eye drop, eye ointment, and so forth,. In general, the administration to patients can be carried 
out by a method known to those skilled in the art, including intraarterial injection, intravenous injection, subcutaneous 
injection, etc. While the doses are different depending on the weight and' age of patient, administration method, and 
such, those skilled in the art can chose proper administration doses if necessary. Further, when the compound is 
• encoded by a DNA, the DNA can be inserted into a vector for gene therapy and thus can be used for gene therapy. 

30 The compound isolated by the screening method of the present invention is expected to be applied to the treatment 
of. for example, cancers, cirrhosis, and Alzheimer's disease. 

[0052] The present invention also provides a disease diagnosing method for cancers, cirrhosis, or Alzheimer's dis- 
ease, comprising the step of detecting the expression of the gene encoding the GPRv protein of the present invention. 
[0053]. In the Example herein, it has been found that the expression levels of the genes encoding the GPRv proteins 
35 of the present invention in affected tissues associated with cancers, cirrhosis, or Alzheimer's disease are significantly 
different as compared to those in normal tissues. Thus, these diseases can be diagnosed by detecting the expression 
of the genes encoding the GPRv proteins of the present invention in tissues of subjects. The term "gene expression" 
means both transcription and translation. 

[0054] The diagnosis method of the present invention can be carried out, for example, as follows. 

40 [0055] The diagnosis can be achieved by extracting RNA from an aliquot of a tissue collected by biopsy or blood 
sample according to a standard method, and quantifying GPRv mRNA by quantitative PCR, Northern hybridization, or 
dot blot hybridization, and such, as describedjn the Example herein. Alternatively, the diagnosis can also be achieved 
by quantifying the GPRv protein in a protein extract from the above tissue by a method such as Western blotting, 
immunoprecipitation, ELISA, and such, or by a noninvasive method where a. labeled compound or antibody binding to 

45 the GPRv protein is administered to patients and detected by PET (positron emission tomography) or the like. 

[0056] When the result of the diagnosis shows that the gene expression in the tissues of a subject exhibits a pattern 
(for example, an increased or decreased gene expression level as compared to that in the normal tissue) identical to 
that of the gene expression in the tissue derived from a patient affected with any one of the above diseases, the subject 
is determined as being affected or as being at a risk of affection with the disease. 

50 [0057] For example, the expression of GPRv8 was detectable in the colon, and the expression level was markedly 
higher in colon cancers. Accordingly; when the expression of GPRy8 is detected at a high level in the colon tissue of 
a subject, the subject is suspected of colon cancer. Alternatively, the expression of GPRvB was undetectable in the 
normal pancreas and uterus, but GPRv8 was expressed at a moderate level after canceration. Accordingly, when the 
expression of GPRv8 can be detected in the pancreas or uterus of a subject, the subject is suspected of pancreatic 

55 cancer or uterine cancer. 

[0058] The expression of GPRv12 was undetectable in the normal ovary and testis, but was detectable after can- 
ceration. Further, the expression level decreased in the hippocampus with Alzheimer's disease. Accordingly, when the 
expression of GPRv12 is detected in the ovary or testis of a subject, the subject is suspected of ovary cancer or 
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testicular cancer. Similarly, when the expression of GPRvl 2 is detected in the hippocampus of a subject at a lower 
level than the normal level, the subject is suspected of Alzheimer's disease. 

[0059] GPRv16 was expressed in the colon, but was undetectable after canceration. The expression level increased 
in the brain after canceration. In the liver, the expression was undetectable after cirrhosis. In the brain of patients with 

5 Alzheimer's disease, the expression level was elevated at the hippocampus. Accordingly, when the expression of 
GPRvl 6 is detected in the colon of a subject at a lower level than the normal level, the subject is suspected of colon 
cancer. Further, when the expression is detected in the brain at a higher level than the normal level, the subject is 
suspected of brain cancer. Further, the expression of GPRvl 6 is detected in the liver at a lower level than the normal 
level, the subject is suspected of cirrhosis. Further, when the expression of GPRvl 6 is detected in the hippocampus 

10 at a higher level than the normal level, the subject is suspected of Alzheimer's disease. 

luOuOj i fie cApicoSiufi ui varnv^ i vvao uiiueibcLidbie in li'icOOiui i diid ie&uS ciiiei Cancer ciuui I. mCuOi uinyiy, wnmi llie 

expression of GPRv21 is detected in the colon or testis of a subject at a lower level than the normal level, the subject 
is suspected of colon cancer or testicular cancer. 

[0061] The expression , level of GPRv40 increased in the brain and testis after canceration, and decreased in the 
15 nver after cirrhosis. Accordingly, when the expression of GPRv40 is detected in the brain or testis at a higher level than 
the normal level, the subject is suspected of brain tumor or testicular cancer. Further, when the expression of GPRv40 
was detected in the liver at a lower level than the normal level, the subject is suspected of cirrhosis, 
[0062] The expression level of GPRv47 increased in the brain and kidney and decreased in the testis, after cancer- 
ation. The expression was undetectable in the iiver after cirrhosis. Accordingly, when the expression of GPRv47 is 
20 detected in the brain or kidney at a higher level than the normal level, the subject is suspected of brain tumor or kidney 
cancer. Further, when the expression of GPRv47 is detected in the iiver at a lower level than the normal level, the 
subject is suspected of cirrhosis. 

[0063] The expression level of GPRv51 decreased in the colon and testis after canceration. The expression level 
also decreased in the liver after cirrhosis as compared to the normal liver. The expression level increased in the hip- 

25 pocampus with Alzheimer's disease. Accordingly, when the expression of GPRv51 is detected in the colon and testis 
at a lower level than the normal level, the subject is suspected of colon cancer or testicular cancer. Further, when the 
expression of GPRv51 is detected in the liver at a lower level than the normal level, the subject is suspected of cirrhosis. 
Further, when the expression of GPRv51 is detected in the hippocampus at a higher level than the normal level, the 
subject is suspected of Alzheimer's disease. 

30 [0064] The expression level of GPRv71 decreased in the colon and kidney, and was undetectable in the liver, after 
cirrhosis. In Alzheimer's disease, the expression level decreased in the frontal lobe. Accordingly, when the expression 
of GPRv71 is detected in the colon or kidney at a lower level than the normal level, the subject is suspected of colon 
cancer or kidney cancer. Further, when the expression of GPRv71 is detected in the liver at a lower level than the 
normal level, the subject is suspected of cirrhosis. Further, when the expression of GPRv71 is detected in the frontal 

35 lobe at a lower level than the normal level, the subject is suspected of Alzheimer's disease. 

[0065] GPRv72 was expressed strongly in the colon, but the expression was undetectable after canceration. The 
expression level of GPRv72 increased in the hippocampus with Alzheimer's disease. Accordingly, when the expression 
of GPRv72 is detected in the colon at a lower level than the normal level, the subject is suspected of colon cancer. 
Further, when the expression of GPRv72 is detected in the hippocampus at a higher level than the normal level, the 

40 subject is suspected of Alzheimer's disease. 

[0066] Furthermore, mutations in the genes encoding GPRv proteins of the present invention may result in the onset 
of the above-mentioned diseases. Thus, the diagnosis for the above-mentioned diseases can be carried out by de- 
tecting such mutations in the genes encoding GPRv proteins of the present invention. 
[0067] Such gene diagnosis can be carried out, for example, as follows. 

45 [0068] As a nucleic acid to be used for the diagnosis, genomic DNA or cDNA may be amplified directly or by PCR 
or other amplification technique. Deletions and insertions can be detected based on size differences of the amplification 
products as compared with that of the normal gene. Point mutations can be identified based on the differences in the 
melting temperature of the amplified DNA hybridized with DNA encoding GPRv. Differences between DNA sequences 
can be found by detecting alterations in the electrophoretic mobility of DNA fragment in a denaturant-containing or 

50 den atu rant-free gel or by direct sequencing of nucleotide sequence of DNA. 

[0069] When the diagnosis result shows that the gene encoding the GPRv protein from a subject has mutations as 
compared with the wild-type sequence, the subject diagnosed to be suspected of the above disease. 
[0070] Namely, a method for diagnosing cancers, cirrhosis, or Alzheimer's disease or a method for diagnosing the 
susceptibility to the diseases are provided by detecting, according to the method described herein, mutations in the 

55 genes encoding the GPRv proteins or increase or decrease in the expression levels of the mRNAs or proteins. 
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Brief Description of the Drawings 
[0071] 

5^ Figure 1 shows a result of BLAST SEARCH with the "GPRv8" amino acid sequence as the query against the entire 

sequence data in SWISS-PROT. The sequence showed 36% homology to HUMAN VASOPRESSIN V1B RECEP- 
TOR. 

Figure 2 shows a result of. BLAST SEARCH with the "GPRv12 M amino acid sequence as the query against the 
entire sequence data in SWISS-PROT The sequence showed 27% homology to RAT 5-HYDROXYTRYPTAMINE 
10 . 6 RECEPTOR. 

Figure 3 shows a result of BLAST SEARCH with the H GPRv16" amino acid sequence as the query against the 
entire sequence data in SWISS-PROT The sequence showed 28% homology to MOUSE G ALAN IN RECEPTOR 
TYPE 1 . 

Figure 4 shows a result of BLAST SEARCH with the "GPRv21" amino acid sequence as the query against the 
is entire sequence data in SWISS-PROT The sequence showed 30% homology to BOVIN NEUROPEPTIDE Y RE- 

CEPTOR TYPE 2. 

Figure 5 shows a result of BLAST SEARCH with the "GPRv40" amino acid sequence as the query against the 
entire sequence data in SWISS-PROT. The sequence showed 34% homology to OXYTOCIN RECEPTOR 
(P97926). 

20 Figure 6 shows a result of BLAST SEARCH with the "GPRv47" amino acid sequence as the query against the 

entire sequence data in SWISS-PROT The sequence showed 43% homology to GPRX_ORYLA PROBABLE G 
PROTEIN-COUPLED RECEPTOR (Q91 1 78). 
. Figure 7 shows a result of BLAST SEARCH with the "GPRv51" amino acid sequence as the query against the 
entire sequence data in SWISS-PROT. The sequence showed 37% homology to PROBABLE G PROTEIN-COU- 

25 PLED RECEPTOR RTA (P23749). 

Figure 8 shows a result of BLAST SEARCH with the "GPRv71" amino acid sequence as the query against the 
entire sequence data in SWISS-PROT The sequence showed 45% homology to P2Y PURINOCEPTOR 3 (P2Y3) 
(Q98907). 

Figure 9 shows a result of BLAST SEARCH with the "GPRv72" amino acid sequence as the query against the 
30 entire sequence data in SWISS-PROT The sequence showed 30% homology to ALPHA-1 A ADRENERGIC RE- 

CEPTOR (002824). 

Figure 10 shows a hydropathy plot for GPRv8. 

Figure 1 1 shows an alignment of GPRv8 and similar families. The mark '*' means that the amino acid is completely 
conserved in alt the sequences at the position marked therewith. The mark ':' means that amino acids at the position 

35 marked therewith are conserved within any one of the following groups: {STA}, {NEQK}, {NHQK}, {NDBQ}, {QHRK}, 

{MILV}, {MILF}, {HY}, and {FYW}. The mark 7 means that amino acids at the position marked therewith are con- 
served within any one of the following groups: {CSA}, {ATV), {SAG}, {STNK}, {STPA}, {SGND}, {SNDEQK}, {ND- 
EQHK}, and {NEQHRK}. 
Figure 12 is continued from Figure 11 . 

40 Figure 1 3 shows a hydropathy plot for GPRvl 2. 

Figure 1 4 shows an amino acid sequence alignment of GPRvl 2 and AF208288. The mark means that the amino 
acid is completely conserved in all the sequences at the position marked therewith. The mark ':' means that amino 
acids at the position marked therewith are conserved within any one of the following groups: {STA}, {NEQK}, 
{NHQK}, {NDBQ}, {QHRK}, {MILV}, {MILF}, {HY}, and {FYW}. The mark '.' means that amino acids at the position 

45 marked therewith are conserved within any one of the following groups: {CSA}, {ATV}, {SAG}, (STNK), {STPA}, 

{SGND}, {SNDEQK}, {NDEQHK}, and {NEQHRK}. 
Figure 15 shows a hydropathy plot for GPRvl 6. 

Figure 16 shows a summary of HMMPFAM, transmembrane domain, and S-S bond of GPRvl 6' The mark 
indicates a region assigned as 7tm_1 based on the result of HMMPFAM. The mark "###" represents transmem- ' 
so brane domain. The mark "@" indicates Cys capable of forming S-S bond. 

Figure 1 7 shows a hydropathy plot for GPRv21 . 

Figure 1 8 shows an amino acid sequence alignment of GPRv21 and the related proteins. The mark '*' means that 
the amino acid is completely conserved in all the sequences at the position marked therewith. The mark ':' means 
that amino acids at the position marked therewith are conserved within any one of the following groups: {STA}, 
55 {NEQK}, {NHQK}, {NDBQ}, {QHRK}, {MILV}, {MILF}, {HY}, and {FYW}. The mark '.' means that amino acids at the 

position marked therewith are conserved within any one.of the following groups: (CSA), {ATV}, (SAG), {STNK}, 
{STPA}, (SGND), {SNDEQK), {NDEQHK), and {NEQHRK). 
Figure 1 9 is continued from Figure 18. 
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Figure 20 shows a hydropathy plot for GPRv40. 

Figure 21 shows a summary of HMMPFAM, transmembrane domain, and S-S bond of GPRv40. The mark 
indicates a region assigned as 7tm_1 based on the result of HMMPFAM. The mark"###" indicates transmembrane 
domain. The mark "@" indicates Cys capable of forming S-S bond. 

5 Figure 22 shows a hydropathy plot for GPRv47. 

Figure 23 shows an alignment of GPRv47 and the related proteins. The mark '*' means that the amino acid is 
completely conserved in all the sequences at the position marked therewith. The mark ':' means that amino acids 
at the position marked therewith are conserved within any one of the following groups: {STA}, {NEQK}, {NHQK}, 
{NDBQ}, {QHRK}, {MILV}, {MILF}, {HY}, and {FYW}. The mark '.' means that amino acids at the position marked 

10 therewith are conserved within any one of the following groups: {CSA}, {ATV], {SAG}, {STNK), {STPA}, {SGND}, 

tSNutuK}, (iMUturiK}, ana tiNtuhrifsj. 
Figure 24 is continued from Figure 23. 
Figure 25 is continued from Figure 24. 
Figure 26 shows a hydropathy plot for GPRv51 . 

15 Figure 27 shows an alignment of GPRv51 and the related proteins. The mark means that the amino acid is 

completely conserved in all the sequences at the position marked therewith. The mark ':' means that amino acid 
at the position marked therewith are conserved within any one of the following groups: {STA}, {NEQK}, {NHQK}, 
{NDBQ}, {QHRK}, {MILV}, {MILF}, {HY}, and {FYW}. The mark 7 means that amino acid at the position marked 
therewith are conserved within any one of the following groups: {CSA}, {ATV}, {SAG}, {STNK}, {STPA}, {SGND}, 

20 {SNDEQK}, {NDEQHK}, and {NEQHRK}. 

Figure 28 shows a hydropathy plot for GPRv71 . 

Figure 29 shows an alignment of GPRv71 and related proteins. The mark'*' means that the amino acid is completely 
conserved in all the sequences at the position marked therewith. The mark 7 means that amino acid at the position 
marked therewith are conserved within any one of the following groups: {STA}, {NEQK}, {NHQK}, {NDBQ}, {QHRK}, 

25 {MILV}, {MILF}, {HY}, and {FYW}. The mark 7 means that amino acid at the position marked therewith are con- 

served within any one of the following groups : {CSA}, {ATV}, {SAG}, {STNK}, {STPA}, {SGND}, {SNDEQK}, {ND- 
EQHK}, and {NEQHRK}. 
Figure 30 is continued from Figure 29. 
Figure 31 shows a hydropathy plot for GPRv72. 
. 30 Figure 32 shows an alignment of GPRv72 and related proteins. The mark '*' means that the amino acid is comp letely 

conserved in all the sequences at the position marked therewith. The mark 7 means that amino acid at the position 
marked therewith are conserved within any one of the following groups: {STA}, {NEQK}, {NHQK}, {NDBQ}, {QHRK}, 
{MILV}, {MILF}, {HY}, and {FYW}. The mark 7 means that amino acid at the position marked therewith are con- 
served within any one of the following groups: {CSA}, {ATV}, {SAG}, {STNK}, {STPA}, {SGND}, {SNDEQK}, {ND- 

35 EQHK}, and {NEQHRK}. 

Figure 33 is continued from Figure 32. 
Figure 34 is continued from Figure 33. 

Best Mode for Carrying out the Invention N 

40 

[0072] The present invention is specifically illustrated below with reference to Examples, but it is not to be construed 
as being limited thereto. Unless otherwise stated, they can be carried out by known methods (Maniatis, T. et al. (1 982): 
"Molecular Cloning - A Laboratory Manual", Cold Spring Harbor Laboratory, NY). 

45 [Example 1] Isolation of the genes encoding the novel G protein-coupled receptors 

[0073] The full-length cDNAs encoding the novel G protein-coupled receptors of the present invention (GPRv8, 

GPRv12, GPRv16, GPRv21 , GPRv40, GPRv47, GPRv51, GPRv71, and GPRv72) were obtained by PCR. 

[0074] The amplification of the novel G protein-coupled receptor GPRv8 was carried out using a Marathon Ready 

50 cDNA (Clontech) derived from human fetus as a template, and forward primer: 5'-ATGCCAGCCAACTTCACAGAG- 
GGCAGCT-3' (SEQ ID NO: 9) and reverse primer: S'-CTAGATGAATTCTGGCTTGGACAGAATC-S 1 (SEQ ID NO: 10). 
PCR was carried out with Pyrobest DNA polymerase (Takara); the thermal cycling profile consisted of preheat at 94°C 
(2.5 minutes) and 25 cycles of 94°C (30 seconds)/60°C (30 seconds)/72°C (1 minute). The amplification resulted in 
about 1 .1 -kbp DNA fragments. The fragments were cloned into pCR2.1 plasmid (Invitrogen). The nucleotide sequence 

55 of the resultant clone was determined by dideoxy terminator method in an ABI377 DNA Sequencer (Applied Biosys- 
tems). The determined sequence is shown in SEQ ID NO: 5. 

[0075] The sequence comprises an open reading frame of 1116 nucleotides (from the first nucleotide to the 1116th 
nucleotide in SEQ ID NO: 5). An amino acid sequence deduced from the open reading frame (371 amino acids) is 
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shown in SEQ ID NO: 1. Since the deduced amino acid sequence contains hydrophobic regions corresponding to 
seven transmembrane domains characteristic of G protein-coupled receptor, the gene is found to encode a G protein- 
coupled receptor 

[0076] The amplification of the novel G protein-coupled receptor GPRv 12 was carried out using a Marathon Ready 
5 cDNA (Clontech) derived from human fetal brain as atemplate, and forward primer: 5'-ATGGGCCCCGGCGAGGCGCT- 
GCTGGCGG-3? (SEQ ID NO: 11) and reverse primer: 5'-TCAGTGTGTCTGCTGCAGGCAGGAATCA-3' (SEQ ID NO: 
12). PCR was carried out with Pyrobest DNA polymerase (Takara) under the presence of 5% formamide; the thermal 
cycling profile consisted of preheat at 94°C (2.5 minutes), 5 cycles of 94°C (5 seconds)/72°C (4 minutes) , 5 cycles of 
94°C (5 seconds)/70°C (4 minutes) , and 25 cycles of 94°C (5 seconds)/68°C (4 minutes). The amplification resulted 
10 in about 1.1 -kbp DNA fragments. The fragments were cloned into pCR2.1 plasmid (Invitrogen). The nucleotide se- 
quence of the resultant clone was determined by dideoxy terminator method in an ABI377 DNA Sequencer (Applied 
Biosystems). The determined sequence is shown in SEQ ID NO: 6. 

[0077] The sequence comprises an open reading frame of 1092 nucleotides (from the first nucleotide to the 1092th 
nucleotide in. SEQ ID NO: 6). An amino acid sequence deduced from the open reading frame (363 amino acids) is 
is shown in SEQ ID NO: 2. Since the deduced amino acid sequence contains hydrophobic regions corresponding to 
seven transmembrane domains characteristic of G protein-coupled receptor, the gene is found to encode a G protein- 
coupled receptor. 

[0078] The amplification of the novel G protein-coupled receptor GPRvl 6 was carried out using a Marathon Ready 
cDNA (Clontech) derived from human brain as a template, and forward primer: 5'-ATGCTGGCAGCTGCC l l iGCA- 

20 GACTCTAAC-3* (SEQ ID NO: 13) and reverse primer: 5 , -CTATTTAACACCTTCCCCTGTCTCTTGATC-3 , (SEQ ID NO: 
14). PCR was carried out with Pyrobest DNA polymerase (Takara); the thermal cycling profile consisted of preheat at 
94°C (2 minutes) and 30 cycles of 94°C (30 seconds)/60°C (30 seconds)/72°C (1 minute). The amplification resulted 
in about 1 .2-kbp DNA fragments. The fragments were cloned into pCR2.1 plasmid (Invitrogen). The nucleotide se- 
quence of the resultant clone was determined by dideoxy terminator method in an ABI377 DNA Sequencer (Applied 

25 Biosystems) . The determined sequence is shown in SEQ ID NO: 7. 

[0079] The sequence comprises an open reading frame of 1260 nucleotides (from the first nucleotide to the 1260th 
nucleotide in SEQ ID NO: 7). An amino acid sequence deduced from the open reading frame (419 amino acids) is 
shown in SEQ ID NO: 3. Since the deduced amino acid sequence contains hydrophobic regions corresponding to 
seven transmembrane domains characteristic of G protein-coupled receptor, the gene is found to encode a G protein- 

30 coupled receptor. 

[0080] The amplification of the novel G protein-coupled receptor GPRv21 was carried out using a Marathon Ready 
cDNA (Clontech) derived from human fetus as a template, and forward primer: S'-ATGGAGACCACCATGGGGTTCAT- 
GGATG-3' (SEQ ID NO: 15) and reverse primer: S'-TTATTTTAGTCTGATGCAGTCCACCTCTTC-S' (SEQ ID NO: 16). 
PCR was carried out with Pyrobest DNA polymerase (Takara) under the presence of 5% formamide; the thermal cycling 

35 profile consisted of preheat at 94°C (2.5 minutes), 5 cycles of 94°C (5 seconds)/72°C (4 minutes), 5 cycles of 94°C (5 
seconds)/70°C (4 minutes), and 25 cycles of 94°C (5 seconds)/68°C (4 minutes). The amplification. resulted in about 
1 .2-kbp DNA fragments. The fragments were cloned into pCR2.1 plasmid (Invitrogen). The nucleotide sequence of the 
resultant clone was determined by dideoxy terminator method in an ABI377 DNA Sequencer (Applied Biosystems). 
The determined sequence is shown in. SEQ ID NO: 8. 

40 [0081 ] The sequence comprises an open reading frame of 1 1 82 nucleotides. An amino acid sequence deduced from 
the open reading frame (333 amino acids) is shown in SEQ ID NO: 4. Since the deduced amino acid sequence contains 
hydrophobic regions corresponding to seven transmembrane domains characteristic of G protein-coupled receptor, 
the gene is found to encode a G protein-coupled receptor. 

[0082] The amplification. of the novel G protein-coupled receptor GPRv40 was carried out using a Marathon Ready 
45 cDNA (Clontech) derived from human fetus as a template, and forward primer: 5'-ATGGAGGATCTCTTTAGCCCCT- 

CAATTC-3' (SEQ ID NO: 27) and reverse primer: 5 , -CTAGAAGGCACTTTCGCAGGAGCAAGGC-3 , (SEQ ID NO: 28). 

PCR was carried out with Pyrobest DNA polymerase (Takara) under the presence of 5% formamide; the thermal cycling 

profile consisted of preheat at 98°C (2.5 minutes), 5 cycles of 98°C (5 seconds)/72°C (4 minutes), 5 cycles of 98°C (5 

seconds)/70 6 C (4 minutes), and 25 cycles of 98°C (5 seconds)/68 D C (4 minutes). The amplification resulted in about 
so 1 .3-kbp DNA fragments, The fragments were cloned into pCR2.1 plasmid (Invitrogen). The nucleotide sequence of the 

resultant clone was determined by dideoxy terminator method in an ABI377 DNA Sequencer (Applied Biosystems). 

The determined sequence is shown in SEQ ID NO: 22. 

[0083] The sequence comprises an open reading frame of 1305 nucleotides (SEQ ID NO: 22). An amino acid se- 
quence deduced from the open reading frame (434 amino acids) is shown in S EQ ID NO: 1 7. Since the deduced amino 
55 acid sequence contains hydrophobic regions corresponding to seven transmembrane domains characteristic of G 
protein-coupled receptor, the gene is found to encode a G protein-coupled receptor. 

[0084] The amplification of the novel G protein-coupled receptor GPRv47 was carried out using a Marathon Ready 
cDNA (Clontech) derived from human fetal brain as a template, and forward primer: 5'-ATGGAGTCCTCACCCATC- 
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CCCCAGTCATC-3' (SEQ ID NO: 29) and reverse primer: S'-TCATGACTCCAGCCGGGGTGAGGCGGCAG-S' (SEQ 
ID NO: 30). PGR was carried out with Pyrobest DNA polymerase (Takara) under the presence of 5% formamide; the 
thermal cycling profile consisted of preheat at 94°C (2 minutes) and 35 cycles of 94°C (30 seconds)/50°C (30 seconds) 
/72°C (1.5 minutes). The amplification resulted in about 1.4-kbp DNA fragments. The fragments were cloned into 

5 pCR2.1 plasmid (Invitrogen). The nucleotide sequence of the resultant clone was determined by dideoxy terminator 
method In an ABI377 DNA Sequencer (Applied Biosystems). The determined sequence is shown in SEQ ID NO: 23. 
[0085] The sequence comprises an open reading frame of 1356 nucleotides (SEQ ID NO: 23). An amino acid se- 
quence deduced from the open reading frame (451 amino acids) is shown in SEQ ID NO: 18. Since the deduced amino 
acid sequence contains hydrophobic regions corresponding to seven transmembrane domains characteristic of G 

10 protein-coupled receptor, the gene" is found to encode a G protein-coupled receptor. 

[uutibj The amplification of the novei G prulein-uOupiwu i t?c£|jiOi GPRvol wa© canied uui u&iny a ividimiiun Ready 
cDNA (Clontech) derived from h umah testis as a template, and forward primer: 5 , -ATGAACCAGACTTTGAATAGCAGT- 
GG-3' (SEQ ID NO: 31) and reverse primer: S'-TCAAGCCCCCATCTGATTGGTGCCCACG-S' (SEQ ID NO: 32). PCR 
was carried out with Pyrobest DNA polymerase (Takara); the thermal cycling profile consisted of preheat at 98°C (2.5 

75 minutes) and 35 cycles of 98°C (30 seconds )/50°C (30 seconds)/68°C (4 minutes) . The amplification resulted in about 
1 .0-kbp DNA fragments. The fragments werecloned into pCR2.1 plasmid (Invitrogen). The nucleotide sequence of the 
resultant clone was determined by dideoxy terminator method in an ABI377 DNA Sequencer (Applied Biosystems) . 
The determined sequence is shown in SEQ ID NO: 24. 

[0087] The sequence comprises an open reading frame of 966 nucleotides (SEQ iD NO: 24). An amino acid sequence 
20 deduced from the open reading frame (321 amino acids) is shown in SEQ ID NO: 19. Since the deduced amino acid 
sequence contains hydrophobic regions corresponding to seven transmembrane domains characteristic of G protein- 
coupled receptor, the gene is found to encode a G protein-coupled receptor. 

[0088] The amplification of the novel G protein-coupled receptor GPRv71 was carried out using a Marathon Ready 
cDNA (Clontech) derived from human fetus as a template, and forward primer: 5'-ATGGAGAAGGTGGACATGAATA- 

25 CATCAC-3' (SEQ ID NO: 33) and reverse primer: 5'-TTACCCAGATCTGTTCAACCCTGGGCATC-3' (SEQ ID NO: 34). 
PCR was carried out with Pyrobest DNA polymerase (Takara); the thermal cycling profile consisted of preheat at 94°C 
(2.5 minutes) , 5 cycles of 98°C (5 seconds)/72°C (4 minutes), 5 cycles of 98°C (5 seconds )/70°C (4 minutes), and 25 
cycles of 98°C (5 seconds)/68°C (4 minutes). The amplification resulted in about 1. 0-kbp DNA fragments. The frag- 
ments were cloned into pCR2.1 plasmid (Invitrogen). The nucleotide sequence of the resultant clone was determined 

30 by dideoxy terminator method in an ABI377 DNA Sequencer (Applied Biosystems). The determined sequence is shown 
in SEQ ID NO: 25. 

[0089] The'sequence comprises an open reading frame of 1002 nucleotides (SEQ ID NO: 25). An amino acid se- 
quence deduced from the open reading frame (333 amino acids) is shown in SEQ ID NO: 20. Since the deduced amino 
acid sequence contains hydrophobic regions corresponding to seven transmembrane domains characteristic of G 

35 protein-coupled receptor, the gene is found to encode a G protein-coupled receptor. 

[0090] The amplification of the novel G protein-coupled receptor GPRv72 was carried out using human genome DNA 
(Clontech) as a template, and forward primer: 5'-ATGACGTCCACCTGCACCAACAGCACGC-3' (SEQ ID NO: 35) and 
reverse primer: 5'-TCAAGGAAAAGTAGCAGAATCGTAGGAAG-3' (SEQ ID NO: 36). PCR was carried out with Py- 
robest DNA polymerase (Takara); the thermal cycling profile consisted of preheat at 94°C (2 minutes) and 30 cycles 

40 of 94°C (30seconds)/55°C(30seconds)/68°C (4 minutes) .The amplification resulted in about 1.5-kbp DNA fragments. 
The fragments were cloned into pCR2.1 plasmid (Invitrogen). The nucleotide sequence of the resultant clone was 
determined by dideoxy terminator method in an ABI377 DNA Sequencer (Applied Biosystems). The determined se- 
quence is shown in SEQ ID NO: 26. 

[0091] The sequence comprises an open reading frame of 1527 nucleotides (SEQ ID NO: 26). An amino acid se- 
45 quence deduced from the open reading frame (508 amino acids) is shown in SEQ ID NO: 21. Since the deduced amino 
acid sequence contains hydrophobic regions corresponding to seven transmembrane domains characteristic of G 
protein-coupled receptor, the gene is found to encode a G protein-coupled receptor. 

[Example 2] BLAST SEARCH of the amino acid sequences of the novel G protein-coupled receptors against 
so SWISS-PROT 

[0092] The result of BLAST SEARCH of the amino acid sequence of "GPRvS" against SWISS-PROT is shown in 
Figure 1. "GPRv8" exhibited the highest homology (36%) to HUMAN VASOPRESSIN V1B RECEPTOR (P47901, 424 
aa) of known G protein-coupled receptors. Thus, "GPRv8" was concluded to be a novel G protein-coupled receptor. 
55 [0093] The result of BLAST SEARCH of the amino acid sequence of "GPRv12" against SWISS-PROT is shown in 
Figure 2. "GPRvl 2" exhibited the highest homology (27%) to RAT 5-HYDROXYTRYPTAMINE 6 RECEPTOR (P31388, 
436 aa) of known G protein-coupled receptors. Thus, GPRvl 2 was concluded to be a novel G protein-coupled receptor. 
[0094] The result of BLAST SEARCH of the amino acid sequence of "GPRvl 6" against SWISS-PROT is shown in 
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Figure 3. "GPRvie" exhibited the highest homology (28%) to MOUSE GALANIN RECEPTOR TYPE 1 (P56479, 348 
aa) of known G protein-coupled receptors. Thus, "GPRvi6" was concluded to be a novel G protein-coupled receptor. 
[0095] The result of BLAST SEARCH of the amino acid sequence of "GPRv21 M against SWISS-PROT is shown in 
Figured "GPRv21" exhibited the highest homology (30%) to BOVIN NEUROPEPTIDE Y RECEPTORTYPE2(P79t 13, 
5 384 aa) of known G protein-coupled receptors. Thus, ,, GPRv21" was concluded to be a novel G protein-coupled re- 
ceptor. 

[0096] The result of BLAST SEARCH of the amino acid sequence of "GPRv40" against SWISS-PROT is shown in 
Figure 5. ,, GPRv40 ,, was not identical to any of known G protein -co up led receptors, but exhibited the highest homology 
(34%) to OXYTOCIN RECEPTOR (P97926, 388 aa). Thus, M GPRv40"was concluded to be a novel G protein-coupled 



[0097] The result of BLAST SEARCH of the amino acid sequence of "GPRv47" against SWISS-PROT is shown in 
Figure 6. "GPRv47" was not identical to any of.known G protein-coupled receptors, but exhibited the highest homology 
(43%) to GPRXJDRYLA PROBABLE G PROTEIN-COUPLED RECEPTOR (Q91178, 428 aa). Thus, "GPRv47" was 
concluded to be a novel G protein-coupled receptor 
is [0098] The result of BLAST SEARCH of the amino acid sequence of "GPRv51" against SWISS-PROT is shown in 
Figure 7. "GPRv51 " was not identical to any of known G protein-coupled receptors, but exhibited the highest homology 
(37%) to PROBABLE G PROTEIN-COUPLED RECEPTOR RTA (P23749, 343 aa). Thus, "GPRv51" was concluded 
* to be a novel G protein-coupled receptor. 
[0099] The result of BLAST SEARCH of the amino acid sequence of "GPRv7i" against SWISS-PROT is shown in 
20 Figure 8. "GPRv71 M was not identical to any of known G protein-coupled receptors, but exhibited the highest homology 
(45%) to Chicken P2Y PURINOCEPTOR 3 (P2Y3) (Q98907, 328 aa). Thus, "GPRv71" was concluded to be a novel 
G protein-coupled receptor. 

[0100] The result of BLAST SEARCH of the amino acid sequence of M GPRv72" against SWISS-PROT is shown in 
Figure 9. "GPRv72" was not identical to any of known G protein-coupled receptors, but exhibited the highest homology 
25 (30%) to ALPHA- 1 A ADRENERGIC RECEPTOR (002824, 466 aa). Thus, ,, GPRv72" was concluded to be a novel G 
protein-coupled receptor. 

[Example 3] Analysis of tissue-specific expression 
30 1 . Reagents 

1.1. Primers for quantitative polymerase chain reaction (PCR) and TaqMan probes: 

[0101] Sense primers, antisense primers, and TaqMan probes were designed by using genetic analysis software 
35 "Primer Express version 1 .0" from PE Biosystems. The ordinary custom-made primers and TaqMan probes were pur- 
chased from Amersham Pharmacia Biotech (Tokyo) and PE Biosystems Japan, respectively. The TaqMan probes were 
connected with a reporter pigment FAM at the 5' end and with a quencher Tamra at the 3' end. The nucleotide sequences 
of primers and TaqMan probes are shown below. 

40 Synthetic DNA for GPRv8 



10 



receptor. 



[0102] 



45 



PCR primer 



G8.9 57F: CCAGGAGCGTTTCTATGCCT (SEQ ID NO: 37) 
G8.1082R: TGTGATCTTTGCTCCCTGCA (SEQ ID NO: 38) 



50 



TaqMan Probe 
ID NO: 39) 



GPRv8.987T: TCAGAACCTGCCAGCATTGAATAGTGCC (SEQ 



55 
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Synthetic DNA for GPRvl 2 
[0103] 

PCR primer G12.794F: ATCTGCTTTGCCCCGTATGT (SEQ ID NO: 40) 
G12.9 03R: ACCGCCTTGCTGTAGGTCAG (SEQ ID NO: 41) 

~ ~ »_ _ ^nn.,i o . Trcpcrrr TT»rr:Tr s rr cT»r: n a f cirri t n 

x av^nuj & c x wmc —«*.-»»— . — - — — - — — •— • ■< . — — _ • 

: 42) 

Synthetic DNA for GPRvl 6 
[0104] 

PCR primer G16.1133F: CCCAGCATGCATACCAGAAAA (SEQ ID NO: 43) 
G16.1254R: CTGTGTCCCTCTCATGCCAAA (SEQ ID NO: 44) 

TaqMah Probe GPRvl6 . 1 193T : TGAGAAGGCAGAGATTCCCATCCTTCCT (SE 

Q ID NO: 45) 

Synthetic DNA for GPRv21 
[0105] 

PCR primer G21-989F: TCGCCATGAGCAACAGCAT (SEQ ID NO: 46) 

G21.1114R: CACTGGACTTACCGCCATTGT (SEQ ID NO: 47) 

TaqMan Probe GPRv21 . 106 4T : AGATCATGTTGCTCCACTGGAAGGCTTCT (S 

EQ ID NO: 48) 

Synthetic DNA for GPRv40 

[0106] 

PCR primer G40.16F: GGATCTCTTTAGCCCCTCAATTC (SEQ ID NO: 49) 
G40.99R: AAGGTCAGGTTGAGACCCCAG (SEQ ID NO: 50) 

TaqMan Probe GPRv40.53T: AACATTTCCGTGCCCATCTTGCTGG (SEQ ID 

NO: 51) 
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Synthetic DNA for GPRv47 
[0107] 

5 

PCR primer G47.1292F: GCTGTTGACTTTCGAATCCCA (SEQ ID NO: 52) 

G47.1393R: ACGGAGGTAGCTGTCTGACATGA (SEQ ID NO: 53) 

10 

TaqMan Probe GPRv47 . 1336T : TGAGTTCCTGGAGCAGCAACTCACCA (SEQ 

ID NO: 54) 

15 Synthetic DNA for GPRv51 
[0108] 

20 PCR primer G51.190F: GGCTTTCGAATGCACAGGAA (SEQ ID NO: 55) 

G51.276R: GGAAGCCATGCTGAAGAGGA (SEQ ID NO: 56) 

25 TaqMan Probe GPRv51.214T: TTCTG.CATCTATATCCTCAACCTGGCGG (SEQ 

ID NO: 57) 

Synthetic DNA for GPRv71 
30 • 
[0109] 

PCR primer G71.746F: TGGCCTCTTCACCCTCTGTTT (SEQ ID NO : 58) . 
35 G71.841R: ATCAAGAGCTGGCAGTCCTGA (SEQ ID NO: 59) 

TaqMan Probe GPRv71.775T: TCCATATCACTCGCTCCTTCTACCTGACCA (S 

40 EQ ID NO: 60) 

Synthetic DNA for GPRv72 
45 [0110] 

PCR primer G72.101F: CCAAAATGCCCATCAGCCT (SEQ ID NO: 61) 
G72.19GR: GCACTATGTTGCCGACGAAA (SEQ ID NO: 62) 

50 

TaqMan Probe GPRv72.132T: CATCCGCTCAACCGTGCTGGTTATCT (SEQ I 

D NO: 63) 
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1 .2. CDNA derived from patients 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



[0111] As cDNAs which had been derived from tumor and norma! tissues from a single patient, Matched cDNA Pairs 
from Clontech were used. The tissues are lung, stomach, colon, ovary, prostate, uterus, and kidney. 
[0112] Some cDNAs derived from following tissues were purchased from BioChain Institute: brain, pancreas, and 
testis from patients with tumor and normal adults; liver from cirrhosis patients and normal adults; kidneyfrom lupus 
disease patients; and the hippocampus and frontal lobe from Alzheimer's disease (AD) patients and normal adults. 

1 .3. Reagents for quantitative PCR: 



[u i i 3] icam'viiaii Onivei'bdi r Cri ivlaoicri IWia (r i_ CiCroyotciTiS; Yv'oo liSCd iTithio C5oc*y. Tu^iCim p-oiC»!». ^wnnw. , .^^^ — ^ 

(PE Biosystems) was used for measuring the internal standard. 
2. Quantitative PCR: 

1) Dilution of template cDNA 

[0114] The cDNAs from BioChain were diluted 50 fold with water, and the cDNAs from Clontech were diluted 5 fold 
with water, for use. 

2) Preparation of Master Mix 

[01 15] A reaction solution with the following composition was prepared. 





Reaction volume 


Preparation volume 


2x Master Mix 


12.5 uJ 


1380 uJ 


Sense primer (50 uJV!) 


0.5 uJ 


55.2 uJ 


Antisense primer (50 \\M) 


0.5 uJ 


55.2 uJ 


TaqMan Probe (5 uJvl) 


ini 


110.4 uJ 


Template cDNA 


2.5 jil 




Purified water 


8uJ ' 


883.2 uJ 


Total volume 


25 uJ 


2484 \i\ 



3) Preparation of PCR solution 

[01 16] 6 uJ template cDNA solution was added to 54 uJ Master Mix solution. Then, 25-u.l aliquots of the mixture were 
added in duplicate to the sample wells of a PCR plate to be placed in a device for quantitative PCR. A 25-uJ aliquot of 
the above-mentioned Master Mix was added to each of two wells for non-template control. The standard curve was 
produced using eight 10-fold serial dilutions of cDNA which had been subcloned into pCEP4 vector, where the dilution 
started from 1 00 pg/|xl. A 25-p.l aliquot of each mixture obtained by combining 54 \x\ of Master Mix prepared in Section 
2) and 6 \l\ of each standard solution prepared above was added into a standard well. Namely, the largest amount of 
the plasmid DNA was 250 pg and the smallest was 25 ag (a: atto, 1 0" 18 ) in the standard wells. After 8-cap strips were 
placed to the top of the wells, the bubbles were removed by light centrifugation. 

4) PCR 

[0117]' The plate was placed in the device for quantitative PCR (GeneAmp 5700 Sequence Detection System: PE 
Biosystems), and then the reaction was carried out according to the following cycling program. 

(1 ) 50°C, 2 minutes: 1 cycle 

(2) 95°C, 1 0 minutes: 1 cycle 
(3) 
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95°C, 15 seconds 
60 °C, 1 minutes 



50 cycles 



10 



5)' Quantitative analysis 



[0118] The quantification was carried out according to the operation manual of GeneAmp 5700, and the result was 
outputted. 

3. Results and conclusions: 



[01 1 9] The GPCR expression profiles obtained with the cDNAs from the organs from normal human and those from 
patients with disease were represented as ratios relative to the expression level of the actin gene as an internal stand- 
's ard. The experiment was carried out in duplicate, and the average values are shown in Table 1 . 

Table 1 

0 q . relative copy number 

GPRv8 GPRv12 GPRv16 GPRv21 GPRv40 CPRv47 GPRv51 GPRv71 GPRv72 

^■.-•-^B'rBiii-.MonnalV-.v, ""^WW. • TWmMWWj ^5^7 ~ . 6 - : 1 ■ ' 0'- TjW< 0 v- 0 

•»l-^.ii^t^f^br?:^. -Ii2i&££^ ^;>r,^i J -Jt^;'dLvjLvA l.~P- 

LungNormaf 0 0 1 0 11 6-1 1 0 

Tumor 1 0 1 0 11 2 1 1 1 

V~~ TT ~Sto™cK T Norm^ 2 ~6~ r^rf—fr: b" 

PancroBS Normal n 0 0 0 0 .4 . 0. 0 ~0 0 

. Tumor " _45 2 0 0 23 2 3 4 1 

Ovary Normal" 0 0 '1 6 2 f ~~2 "l T 

3 0 Tumor 0 4 0 0 21 1 3 3 0 

"~J""^u^ : • T'TT* 3*. y 

Prostate'NorTTial "J™ O* 6" "5" \B f r *3 " Of 

_ Tumor 6 0 0 0 9 0 8 3 0. 

35 KTdniy"Norm*aT "9 0* Cf~~ 'o'"""ZS '6 27" T «f 

Tumor 9 0 0 0 28 10 15 0 0 

Lupus" 25 _ 0 1 0 1 0 31 0 

\ k/ '-^! ^cirrhosi'a. u ' : '' ■ K^fe^Sb ffe^-o'../ o :-'-o'-'. _j o 

Hfppocimpui Norm"5f° " 12* " 4* 6" "~40 TfS" 2 5 T " . 1 

40 ' „ AD 0 16 1 50 3 111 63 55 . 12 27 



[0120] When a 3-fold or more alternation in the expression level was reproducible, the difference is assessed as 
45 being significant. The cDNAs derived from the organs marked with 1 ) were purchased from BioChain; and the cDNAs 
derived from the organs without the mark were purchased from Clontech. The disease-dependent differences in the 
expression levels of the respective genes are summarized below. 

[0121] The expression of GPRv8 was undetectable in the normal pancreas and uterus, but GPRvS was expressed 
at a moderate level after canceration. GPRv8 was strongly expressed in the colon, and was more strongly expressed 
so in colon cancer. 

[0122] The expression level of GPRv12 was generally low. The expression was undetectable in the normal ovary 
and testis, but was found after canceration. The expression level decreased in the hippocampus with Alzheimer's 
disease. 

[0123] GPRv16 was expressed in the colon, but was undetectable after canceration. The expression level increased 
55 in the brain after canceration. In the liver, the expression was undetectable after cirrhosis. In the brain of Alzheimer's 
disease patients, the expression level was elevated in the hippocampus. 

[0124] The expression level of GPRv21 was low, and was undetectable in the colon and testis after canceration. 
[0125] The expression level of GPRv40 increased in the brain and testis after canceration, and decreased in the 
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liver after cirrhosis. 

[0126] The expression level of GPRv47 increased in the brain and kidney and decreased in the testis after cancer- 
ation. The expression was undetectable in the liver after cirrhosis. 

[0127] GPRv51 was strongly expressed in the colon, but the expression level decreased after canceration. The 
expression level decreased in the testis after canceration. The expression level also decreased in the liver after cirrhosis 
as compared to the normal liver. The expression level was low in the brain, but increased in the hippocampus with 
Alzheimer's disease. 

[0128] The expression level of GPRv71 decreased in the colon and kidney after canceration, and the expression 
thereof was undetectable in the liver after cirrhosis. In the patient with Alzheimer's disease, the expression level de- 
creased in the frontal lobe. 

[CI 29] Grn* 72 vvao oXpi cb&eu' oifuuQiy in iiits cuioi'i, bui Lite tsApi tsScsiun meieui w&S undeieuicibie aiifciJ U&llCfcif &UOM. 

The expression level was low in the brain, but increased in the hippocampus with Alzheimer's disease. 
[Example 4] Analysis of GPRv8 with bioinformatics 
1. Homology search of GPRv8 

[0130] The amino acid sequence of GPRv8 was analyzed by searching known sequences (the known sequence 
databases are produced in EMBL (Release 64, httpy/www.ebi. ac.uk/), GENBANK (Release 120.0, http://www.ncbi. 
nlm.nih.gov/), and PIR (Release 66.00, http://www-nbrf.georgetown.edu/pir/)) with an analysis program (BLAST 2.0) 
(Altschul, Stephen F. et al. (1 997) Nucleic Acids Res. 25:3389-3402.). The result showed that GPRv8 had homology 
to the sequences shown in Table 2. Thus, GPRv8 was revealed to be a novel clone having homology to GPCR. The 
amino acid sequence of GPRv8 was analyzed by searching known sequences with an analysis program (BLAST 2.0); 
the result (data with the E-value lower than e-39) is shown in Table 2. 



Table 2 



Hit (ID) 


E-value 


Identities % 


Description 


AE003754 


2e-68 


43 


gene: "CG6 1 1 1 "-Drosophila melanogaster 


AF1 47743 


7e-43 


33 


vasotocin VT1 receptor-Gallus gallus 


AF1 84966 


2e-42 


33 


arginine vasotocin receptor- Platichthys flesus 


X93313 


4e-42 


36 


mesotocin receptor-giant toad 


X76321 


8e-42 


32 


vasotocin receptor-white sucker 


X87783 


4e-41 


33 * 


isotocin receptor-white sucker 


X64878 


3e-40 


. 32 


oxytocin receptor- H. sapiens 


U82440 


7e-40 


32 


oxytocin receptor- Macaca mulatta 



2. Prediction of transmembrane domain 

[0131] The amino acid sequence of GPRv8 was analyzed according to the method of Kyte-Doolittle (J. Kyte and R. 
F. Doolittle, (1982), J. Mol. Biol, 157,105-132.), for obtaining a hydropathy plot and used to predict the transmembrane 
domain. The result showed that GPRv8 had seven transmembrane domains (TM1-TM7) (Figure 10).- 

3. HMMPfam search 

[0132] Using the amino acid sequence of GPRv8 as the query, PFAM search based on the hidden Markov model 
(HMMPFAM (Sonnhammer EL, et al., Nucleic Acids Res 1998 Jan 1; 26 (1) ;320-322)) was carried out. The search 
was carried out with the hidden Markov model of HMMER version 2. 1 (http://hmmer.wustl.edu/) and the PFAM database 
of Pfam Version 5.5 (http://www.sanger.ac.uk/Software/Pfam/index.shtmI). 

[0133] The result indicated that GPRv8 comprises tm7_1 (Rhodopsin family). The result of HMMPfam search is 
shown in Table 3. 
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Table 3 





Hit 


Score 


Expect 


Q 
from 


Q 
to 


Description 


5 


7tm_1 


164.2 


5.1e-51 


66 


330 


7 transmembrane receptor 
(rhodopsin family) 


10 


Hit: name of the domain deduced by the search. 
Score: the higher the value, the higher the reliability. 
Expext: as the value approaches 0, the reliability becomes higher. 
Q from: the start position of the deduced domain. 
Q to: the termination position of the deduced domain. 
Description: explanation of the deduced domain. 



15 4. Amino acid sequence alignment 

[0134] The amino acid sequences of GPRv8 and proteins shown in Table 2 were aligned together by using Clustalw 
1 .7 (Figures 1 1 and 1 2). The result showed that GPRv8 comprise seven transmembrane domains (### ###) and Cys 
(Cys marked with M @") participating in specific S-S bonding of GPCR. 

20 

[Example 5] Analysis of GPRv12 with bio informatics 
1 . Homology search of GPRvl 2 

25 [0135] The amino acid sequence of GPRv12 was analyzed by searching known sequences (the known sequence 
databases are produced in EMBL (Release 64, http://www.ebi.ac.uk/), GEN BANK (Release 120.0, http://www.ncbi. 
nlm.nih.gov/), and PIR (Release 66.00, http://www-nbrf.georgetown.edu/pir/)) with an analysis program (BLAST 2.0) 
(Altschul, Stephen F. et al. (1997) Nucleic Acids Res. 25:3389-3402.). The result showed that GPRvl 2 had homology 
to the sequences shown in Table 4. Thus, GPRvl 2 was revealed to be a novel clone having homology to GPCFL The 

30 amino acid sequence of GPRvt 2 was analyzed by searching known sequences with an analysis program (BLAST 2.0); 
the result (data with the E-value lower than e-15) is shown in Table 4. 



Table 4 



Hit (ID) 


E-value 


Identities % 


Description 


AF208288 


8e-88 ' 


50 


orphan G protein-coupled receptor GPR26-Rattus norvegicus 


L03202 


2e-17 


24. 


5-hydroxytryptamine receptor-rat 


L41146 


5e-17 


23 


5-HT6 serotonin receptor-Rattus norvegicus 


S62043 


2e-16 


25 


serotonin receptor 6-rat 


L41147 


2e-16 


24 


5-HT6 serotonin receptor-Homo sapiens 


AF134158 


4e-16 


23 


serotonin 6 receptor-Mus musculus 


L14856 


4e-16 


26 


somatostatin receptor 4-Human 


Y 14627 


5e-16 


21 


Dopamine receptor-Cyprinus carpio 


L07833 


6e-16 


26 


somatostatin receptor 4-Homo sapiens 


AF069547 


8e-16 


21 


putative odorant receptor LOR4 
Lampetra fluviatilis 



2. Prediction of transmembrane domain ^ 

[0136] The amino acid sequence of GPRvl 2 was analyzed according to the method of Kyte-Doolittle (J. Kyle and 
R. F. Doolittle, (1982), J. Mol. Biol., 157,105-132.), for obtaining a hydropathy plot and used to predict the transmem- 
brane domain. The result showed that GPRvl 2 had seven transmembrane domains (TM1 -TM7) (Figure 13). 
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3. HMMPfam search 

[0137] Using the amino acid sequence of GPRv12 as the query, PFAM search based on the hidden Markov model 
(HMMPFAM (Sonnhammer EL, et al., Nucleic Acids Res 1998 Jan 1; 26 (1):320-322)) was carried out. The search 
5 was carried out with the hidden Markov model of HMMER version 2. 1 (http://hmmer.wustl.edu/) and the PFAM database 
of Pfam Version 5.5 (http://www.sanger.ac.uk/Software/Pfam/index.shtml). 

[0138] The result indicated that GPRv12 comprises tm7J (Rhodopsin family). The result of HMMPfam search is 
shown in Table 5. 

10 Table 5 



20 



Hit 


Score 


Expect 


Q from 


Qto 


Description 


7tm_1 


74.7 


7.7e-23 


22 


294 


7 transmembrane receptor 
(rhodopsin. family) 


Hit: name of the domain deduced by the search. 
Score: the higher the value, the higher the reliability. 
Expext: as the value approaches 0, the reliability becomes higher. 
Q from: the start positron of the deduced domain. 
Q to: the termination position of the deduced domain. 
Description: explanation of the deduced domain. 



4. Amino acid sequence alignment 

[0139] The amino acid sequences of GPRv12 and orphan G protein-coupled receptor GPR26- Rattus norvegicus 
25 (AF208288) were aligned together by using Clustalw 1 .7 (Figure 14). The result showed that GPRv12 comprise seven 
transmembrane domains (### ###) and Cys (Cys marked with "@") participating in specific S-S bonding of GPCR. 

[Example 6] Analysis of GPRvl 6 with bio informatics 

30 1. Homology search of GPRvl 6 

[0140] The amino acid sequence of GPRvl 6 was analyzed by searching known sequences (the known sequence 
databases are produced in EMBL (Release 64, http://www.ebi.ac.uk/), GEN BANK (Release 120.0, http://www.ncbi. 
nlm.nih.gov/), and PIR (Release 66.00, http://www-nbrf.georgetown.edu/pir/)) with an analysis program (BLAST 2.0) 
35 (Altschul, Stephen F. et al. (1997) Nucleic Acids Res. 25:3389-3402,). The result showed that GPRv16 had homology 
to the sequences shown in table 6. Thus, GPRvl 6 was revealed to be a novel clone having homology to GPCR. The 
amino acid sequence of GPRvl 6 was analyzed by searching known sequences with an analysis program (BLAST 2.0); 
the result (data with the E-value lower than e-1 8) is shown in Table 6. 



Table 6 



Hit (ID) 


E-value 


Identities % 


Description 


AF042784 


4e-20 


23 


GALANIN RECEPTOR TYPE 2-Mus musculus 


U30290 


4e-20 


27 


galanin receptor GALR1 -Rattus norvegicus 


U90657 


6e-20 


27 


GALANIN RECEPTOR TYPE 1 -mouse 


AF042782 


7e-20 


25 


galanin receptor type 2-Homo sapiens 


U94322 . 


1e«19 


24 


galanin receptor type2- Rattus norvegicus 


AF077375 


6e-19 


23 


galanin receptor type2-Mus musculus 



2. Prediction of transmembrane domain 

[0141] The amino acid sequence of GPRvl 6 was analyzed according to the method of Kyte-Doolittle (J. Kyte and 
55 R, F. Doolittle, (1982), J. Mol. Bioi, 157,105-132.), for obtaining a hydropathy plot and used to predict the transmem- 
brane domain. The result showed that GPRvl 6 had seven transmembrane domains (TM1-TM7) (Figure 15). 
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3. HMMPfam search 

[0142] Using the amino acid sequence of GPRv16 as the query, PFAM search based on the hidden Markov model 
(HMMPFAM (Sonnhammer EL, et al. f Nucleic Acids Res 1998 Jan 1; 26 (1 ):320-322)) was carried out. The search 
was carried out with the hidden Markov model of HMMER version 2.1 (http://hmmer.wustl.edu/) and the PFAM database 
of Pfam Version 5.5 (http://www.sanger.ac.uk/Software/Pfam/index.shtmi). 

[0143] The result indicated that GPRv16 comprises tm7_1 (Rhodopsin family). The result of HMMPfam search is 
shown in Table 7. 

Table 7 



" Hit 


Score 


Expect 


Q from 


Qto 


Description 


7tm_1 


23:8 


8.3e-7 


155 


306 


7 transmembrane receptor 
(rhodopsin family) 


7tm_1 


13.3 


0.0017 


53 


133 


7 transmembrane receptor 
(rhodopsin family) 


Hit: name of the domain deduced by the search. 
Score: the higher the value, the higher the reliability. 
Expext: as the value approaches 0, the reliability becomes higher. 
Q from: the start position of the deduced domain. 
Q to: the termination position of the deduced domain. 
Description: explanation of the deduced domain. 
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4. Amino acid sequence alignment 

[0144] The result of sections 3 and 4 are indicated in Figure 16. The result showed that GPRvl 6 comprise Cys (@) 
participating in specific S-S bonding of GPCR. 
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[Example 7] Analysis of GPRv21 with bioinformatics 
1 . Homology search of GPRv21 

[0145] The amino acid sequence of GPRv21 was analyzed by searching known sequences (the known sequence 
databases are produced in EMBL (Release 64, http://www.ebi.ac.uk/), GENBANK (Release 120.0, http://www.ncbi. 
nlm.nih.gov/), and PIR (Release 66.00, http://www-nbrf.georgetown.edu/pir/)) with an analysis program (BLAST 2.0) 
(Altschul, Stephen F. et al. (1997) Nucleic Acids Res. 25:3389-3402.). The result showed that GPRv21 had homology 
to the sequences shown in Table 8. Thus, GPRv21 was revealed to be a novel clone having homology to GPCR. The 
amino acid sequence of GPRv21 was analyzed by searching known sequences with an analysis program (BLAST 2.0); 
the result (data with the E-value lower than e-35) is shown in Table 8J ; 

Table 8 



Hit (ID) 


E-value 


Identities % 


Description 


AL121755 


0.0 


89 


G-protein coupled receptor-Human 


AF236082 


0.0 


83 


G-protein coupled receptor GPR73-Mus musculus 


M81490 


9e-37 


34 


neuropeptide receptor-D. melanogaster 


U50144 


3e-36 


30. 


type 2 neuropeptide Y receptor-Bos taurus 


U42766 


6e-36 


29 


neuropeptide y2 receptor-Human 


AF037444 


8e-36 


28 


cardioexcitatory receptor-Lymnaea stagnalis 


D86238 


8e-36 


28 


neuropeptideY-Y2 receptor-Mus musculus 


U42389 


8e-36 


29 


neuropeptide y/peptide YY receptor type 2-human 


U76254 


8e-36 


29 


neuropeptide Y receptor type 2-Human 
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2. Prediction of transmembrane domain 

[0146] The amino acid sequence of GPRv21 was analyzed according to the method of Kyte-Doo little (J. Kyte and 
R. R Doolittle, (1982), J. Mol. Biol., 157,105-132.), for obtaining a hydropathy plot and used to predict the transmem- 
brane domain. The result showed that GPRv21 had seven transmembrane domains (TM1-TM7) (Figure 17). 
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3. HMMPfam search 

[0147] Using the amino acid sequence of GPRv21 as.the query, PFAM search based on the hidden Markov model 
(HMMPFAM (Sonnhammer EL, et a!., Nucleic Acids Res 1998 Jan 1; 26 (1):320-322)) was carried out. The search 
was carnea our wnn me niaaen iviarKov moaei oi" HiviiviER veisiun 2. i (iitLp;//Tiiriiher:wuoii.euu/') and Lhe F FAm Jaiauaoc 
of Pfam Version 5.5 (http://www.sanger.ac.uk/Software/Pfam/index.shtml). 

[0148] The result indicated that GPRv21 comprises tm7_1 (Rhodopsin family). The result of HMMPfam search is 
shown in Table 9.~ 

Table 9 



Hit 



7tm 1 



Score 



188.1 



Expect 



1 .6e-58 



Q from 



79 



Q to 



338 



Description 



7 transmembrane receptor 
(rhodopsin family) 



" Hit: name of the domain deduced by the search. 
Score: the higher the value, the higher the reliability. 
Expext: as the value approaches 0, the reliability becomes higher. 
Q from: the start position of the deduced domain. 
Q to: the termination position of the deduced domain. 
Description: explanation of the deduced domain. 



4. Amino acid sequence alignment 

[0149] The amino acid sequences of GPRv21 and proteins shown in Table 8 were aligned together by using Clustaiw 
1 .7 (Figures 1 8 and 1 9). The result showed that GPRv21 comprise seven transmembrane domains (### ###) and Cys 
(Cys marked with u @") participating in specific S-S bonding of GPCR. 

[Example 8] Analysis of GPRv40 with bioinformatics 

1 . Homology search of GPRv40 

[0150] The amino acid sequence of GPRv40 was analyzed by searching known sequences (the known sequence 
databases are produced in EMBL (Release 64, httpV/www. ebi.ac.uk/), GENBANK (Release 120.0, http://www.ncbi. 
nlm.nih.gov/), and PIR (Release 66.00, http://www-nbrf.georgetown.edu/pir/)) with an analysis program (BLAST 2.0) 
(Altschul, Stephen F: et al. (1 997) Nucleic Acids Res. 25:3389-3402.). The result showed that GPRv40 had homology 
to the sequences shown in Table 1 0. Thus, GPRv40 was revealed to be a novel clone having homology to GPCR. The 
amino acid sequence of GPRv40 was analyzed by searching known sequences with an analysis program (BLAST 2.0); 
the result (data with the E-value lower than e-11) is shown in Table 10. 

Table 10 



Hit (ID) 


E-value 


Identities % 


Description 


D86599 


1e-13 


23 


oxytocin receptor- Mus sp. 


U 15280 


4e-13 


23 


oxytocin 23 receptor-Rattus norvegicus 


X76321 


1e-12 


22 


vasotocin receptor-white sucker 


X64878 


2e-12 


21 


oxytocin receptor-H. sapiens 


X87783 


2e-12 


21 


isotocin receptor-C.commersoni 


D45400 


3e-12 


23 


vasopressin receptor V1b-rat 


L37112 


3e-12 


24 


vasopressin receptor subtype 1b-Homo sapiens 
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Table 1 0 (continued) 



Hit (ID) 


E-value 


Identities % 


Description 


U27322 


6e-12 


23 


arginine-vasopressin V1b receptor-Rattus norvegicus 


U82440 


6e-12 


,21 


oxytocin receptor- Macaca mulatta 



2. Prediction of transmembrane domain 

[0151] The amino acid sequence of GPRv40 was analyzed according to the method of Kyte-Doolittle (J. Kyte and 
R. F. Doolittle, (1982), J. Mot. BioL t 157,105-132.), for obtaining a hydropathy plot and used to predict the transmem- 
brane domain. The result showed that GPRv40 had seven transmembrane domains (TM1-TM7) (Figure 20). 

3. HMMPfam search 

[0152] Using the amino acid sequence of GPRv40 as the query, PFAM search based on the hidden Markov model 
(HMMPFAM (Sonnhammer EL, et al., Nucleic Acids Res 1998 Jan 1; 26 (1 ):320-322)) was carried out. The search 
was carried out with the hidden Markov model of HMMER version 2. 1 (http://hmmer.wustl.edu/) and the P FAM database 
of Pfam Version 5.5 (http://www.sanger.ac.uk/Software/Pfam/index.shtml). 

[0153] The result indicated that GPRv40 comprises tm7_1 (Rhodopsin family). The result of HMMPfam search is 
shown in Table 11 . 



Table 11 



Hit 


Score 


Expect 


Q from 


Q to 


Description 


7tm_1 


26.5 


1.1e-07 


228 


352 


7 transmembrane receptor 
(rhodopsin family) 


7tm_1 


18.1 


5e-05 


59 


181 


7 transmembrane receptor 
(rhodopsin family) 


Hit: name of the domain deduced by the search. 
Score: the higher the value, the higher the reliability. 
Expext: as the value approaches 0, the reliability becomes higher. 
Q from: the start position of the deduced domain. 
Q to: the termination position of the deduced domain. 
Description: explanation of the deduced domain. 



4. Amino acid sequence alignment 

[0154] The result of section 3 and 4 are indicated in Figure 21 . The result showed that GPRv40 comprise Cys (@) 
participating in specific S-S bonding. of GPCR ? 

[Example 9] Analysis of GPRv47 with bioinformatics 

1. Homology search of GPRv47 

[0155] The amino acid sequence of GPRv47 was analyzed by searching known sequences (the known sequence 
databases are produced in EMBL (Release 64, http://www.ebi.ac.uk/), GENBANK (Release 120.0, http://www.ncbi. 
nlm.nih.gov/), and PIR (Release 66.00, http://www-nbrf.georgetown.edu/pir/)) with an analysis program (BLAST 2.0) 
(Altschul, Stephen F. et al. (1997) Nucleic Acids Res. 25:3389-3402.). The result showed that GPRv47 had homology 
to the sequences shown in Table 12. Thus, GPRv47 was revealed to be a novel clone having homology to GPCR. The 
amino acid sequence of GPRv47 was analyzed by searching known sequences with an analysis program (BLAST 2.0); 
the result (data with the E-value lower than e-11) is shown in Table 12. 



Table 1 2 



Hit (ID) 


E-value 


Identities % 


Description 


D43633 


1e-85 


41 


G protein-coupled 7-transmembrane receptor-Medaka fish 
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Table 12 (continued) 



Hit (ID) 


E-value 


Identities % 


Description 


X98133 


2e-28 


27 


histamine H2 receptor-H. sapiens 


M32701 


3e-28 


28 


histamine H2 receptor-Canine histamine 


L41147 


6e-28 


31 


5-HT6 serotonin receptor-Homo sapiens 


U25440 


8e-28 


26 


histamine H2 receptor-Cavia porcellus 


D49783 


1e-27 


28 


histamine H2 receptor- Human 


U 64032 


2e-27 


27 


alpha 1d adrenoceptor-Oryctolagus cuniculus 


S 73473 


3e-27 


28 


beta 3- adrenergic receptor- rats 


M74716 


4e-27 


28 


beta-adrenergic receptor- Rat 


S57565 


6e-27 


27 


histamine H2- receptor- rats 



2. Prediction of transmembrane domain - 

20 [0156] The amino acid sequence of GPRv47 was analyzed according to the method of Kyte-Doolittle (J. Kyte and 
R. F. Doolittle, (1982), J. Mol. Biol., 157,105-132.), for obtaining a hydropathy plot and used to predict the transmem- 
brane domain. The result showed that GPRv47 had seven transmembrane domains (TM1-TM7) (Figure 22). 

3. HMMPfam search 

25 

[0157] Using the amino acid sequence of GPRv47 as the query, PFAM search based on the hidden Markov model 
(HMMPFAM (Sonnhammer EL, et al., Nucleic Acids Res 1998 Jan 1; 26 (1):320-322)) was carried out. The search 
was carried out with the hidden Markov model of HMMER version2.1 (http://hmmer.wustl.edu/) and the PFAM database 
of Pfam Version 5.5 (http://www.sanger.ac.uk/Software/Pfam/index.shtml). 
30 [0158] The result indicated that GPRv47 comprises tm7_1 (Rhodopsin family). The result of HMMPfam search is 
shown in Table 13. 



Table 13 



35 


Hit 


Score 


Expect 


Q from 


Qto 


Description 


7tm_1 


137.9 , 


9.6e-43 


59 


341 


7 transmembrane receptor 
(rhodopsin family) 


40 


Hit: name of the domain deduced by the search. 
Score: the higher the value, the higher the reliability. 
Expext: as the value approaches 0, the reliability becomes higher. 
Q from: the start position of the deduced domain. 
Q to: the termination position of the deduced domain. 
Description: explanation of the deduced domain. 



45 4. Amino acid sequence alignment 



[01 59] The amino acid sequences of GPRv47 and proteins shown in Table 2 were aligned together by using Clustalw 
1 .7 (Figures 23 to 25). The result showed that GPRv47 comprise seven transmembrane domains (### ###) and Cys 
(Cys marked with "@ M ) participating in specific S-S bonding of GPCR. 

50 

[Example 10] Analysis of GPRv51 with bioinformatics 

1 . Homology search of GPRv51 

55 [0160] The amino acid sequence of GPRv51 was analyzed by searching known sequences (the known sequence 
databases are produced in .EMBL (Release 64, http://www.ebi.ac.uk/), GENBANK (Release 120.0, http://www.ncbi. 
nlrn.nih.gov/), and PIR (Release 66.00, http://www-nbrf.georgetown.edu/pir/)) with an analysis program (BLAST 2.0) 
(Altschul, Stephen F. et al. (1997) Nucleic Acids Res. 25:3389-3402.). The result showed that GPRv51 had homology 
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to the sequences shown in Table 14. Thus, GPRv51 was revealed to be a novel clone having homology to GPCR. The 
amino acid sequence of GPRv51 was analyzed by searching known sequences with an analysis program (BLAST 2.0); 
the result (data with the E-value lower than e-18) is shown in Table 14. 

Table 14 



Hit (ID) 


E-value ■ 


Identities % 


Description 


M35297 


4e-43 


36 


G-protein coupled receptor-Rat 


J03823 


1e-42 


34 


Rat mas oncogene, complete cds. 


M13150 


3e-40 


34 


mas proto-oncogene- Human 


X67735 


1e-39 


35 


Mas proto-oncogene-M.musculus mas 


AL035542 


1e-35 


36 


MAS-related Gprotein-coupled receptor MRG-Human 



2. Prediction of transmembrane domain 

[0161] The amino-acid sequence of GPRy51 was analyzed according to the method of Kyte-Doolittle (J. Kyte and 
R. F. Doolittle, (1982), J. Mol. Biof. t 157,105-132.), for obtaining a hydropathy plot and used to predict the transmem- 
brane domain. The result showed that GPRv51 had seven transmembrane domains (TM1-TM7) (Figure 26). 

3. HMMPfam search 

[0162] Using the amino acid sequence of GPRv51 as the query, PFAM search based on the hidden Markov model 
(HMMPFAM (Sonnhammer EL, et al„ Nucleic Acids Res 1998 Jan 1; 26 (1):320-322)) was carried out. The search 
was carried out with the hidden Markov model of HMMER version 2.1 (http://hmmer.wustl.edu/) and the PFAM database 
of Pfam Version 5.5 (http://www.sanger.ac.uk/Software/Pfam/index.shtml). 

[0163] The-result indicated that GPRv51 comprises tm7_1 (Rhodopsin family). The result of HMMPfam search is 
shown in Table 15. 

Table 15 



Hit 


Score 


Expect 


Q from 


Q to 


Description 


7tm_1 


32.6 


1 .4e-09 


44 


78 


7 transmembrane receptor 












(rhodopsin family) 


7tm_1 


30.1 


8.7e-09 


104 


276 


7 transmembrane receptor 












(rhodopsin family) 


Hit: 


name of the domain deduced 


by the search. 


Score: the higher the value, the higher the reliability. 


Expext: as the value approaches 0, the reliability becomes higher. 


. Q from; the start position of the deduced domain. 


Q to: the termination position of the deduced domain. 


Description: explanation of the deduced domain. 



45 4. Amino acid sequence alignment 

[0164] The amino acid sequences of GPRv51 and G-protein coupled receptor- Rat (M35297) were aligned together 
by using Clustalw 1 .7 (Figure 27). The result showed that GPRv51 comprise seven transmembrane domains (### ###). 



[Example 11] Analysis of GPRv71 with bioinformatics 
1 . Homology search of GPRv71 

[0165] The amino acid sequence of GPRv71 was analyzed by searching known sequences (the known sequence 
databases are produced in EMBL (Release. 64, http://www.ebi.ac.uk/), GENBANK (Release 120.0, http://www.ncbi. 
nlm.nih.gov/), and PIR (Release 66.00, http://www-nbrf.georgetown.edu/pir/)) with an analysis' program (BLAST 2.0) 
(Altschul, Stephen F. et al. (1997) Nucleic Acids Res. 25:3389-3402.). The result showed that GPRv71 had homology 
to the sequences shown in Table 16. Thus, GPRv71 was revealed to be a novel clone having homology to GPCR.- The 
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10 



15 



20 



amino acid sequence of GPRv71 was analyzed by searching known sequences with an analysis program (BLAST 2.0); 
the result (data with the E-value lower than e-35) is shown in Table 16. 

Table 16 



Hit (ID) 


E-value 


Identities % 


Description 


AF069555 


9e-44 


44 


G protein-coupled receptor p2y3-Meleagris gallopavo 


X98283 


9e-44 


45 


P2Y PURINOCEPTOR 3-G.domesticus 


AF031897 


6e-41 


40 


P2Y nucleotide receptor-Meleagris gallopavo 


X99953 


le-39 


41 


rz Y 1-uniiNUL.nr \ vjn o- A.iaevis 


D63665 


2e-37 


41 


novel G protein -coupled P2 receptor- Rat 


Y^14705 


1e-36 


40 


P2Y4 receptor gene-Rattus norvegicus 


AJ277752 


2e-36 


41 


P2Y4 receptor- Mus musculus 



2. Prediction of transmembrane domain . 

[0166] The amino acid sequence of GPRv71 was analyzed according to the method of Kyte-Doo little (J. Kyte and 
R. F. Doolittle, (1982), J. Moi B/o/., 157,105-132 ), for obtaining a hydropathy plot and used to predict the transmem- 
brane domain. The result showed that GPRv71 had seven transmembrane domains (TM1-TM7) (Figure 28). 



25 



30 



35 



40 



50 



55 



3. H MM Pf am search 

[0167] Using the amino acid sequence of GPRv71 as the query, PFAM search based on the hidden Markov model 
(HMMPFAM (Sonnhammer EL, et al., Nucleic Acids Res 1 998 Jan 1 ; 26 (1 ) : 320-322)) was carried out. The search 
was carried out with the hidden Markov model of HMMER version 2.1 (http://hmmer.wustl.edu/) and the PFAM database 
of Pfam Version 5.5 (http://www.sanger.ac.uk/Software/Pfam/index.shtml). 

[0168] The result indicated that GPRv71 comprises tm7_1 (Rhodopsin family). The result of HMMPfam search is 
shown in Table 17. 

Table 17 



Hit 



7tm 1 



Score 



90.6 



Expect 



7.6e-28 



Q from 



40 



Qto 



161 



Description 



7 transmembrane receptor 
(rhodopsin family) 



Hit: name of the domain deduced by the search. 
Score: the higher the value, the higher the reliability. 
Expext: as the value approaches 0, the reliability becomes higher. 
Q from: the start position of the deduced domain. 
Q to: the termination position of the deduced domain. 
Description: explanation of the deduced domain. 



4. Amino acid sequence alignment 

[01 69] The amino acid sequences of GPRv71 and proteins shown in Table 2 were aligned together by using Clustalw 
1 .7 (Figures 29 and 30). The result showed that GPRv71 comprise seven transmembrane domains (### ###). 

[Example 12] Analysis of GPRv72 with bioinformatics 

1 . Homology search of GPRv72 

[0170] The amino acid sequence of GPRv72 was analyzed by searching known sequences (the known sequence 
databases are produced in EMBL (Release 64, http://www.ebi.ac.uk/), GENBANK (Release 120.0, http://www.ncbi. 
nlm.nih.gov/), and PIR (Release 66.00, http://www-nbrf.georgetown.edu/pir/)) with an analysis program (BLAST 2.0) 
(Altschul, Stephen F. et al. (1997) Nucleic Acids Res. 25:3389-3402.). The result showed that GPRv72 had homology 
to the sequences shown in Table 1 8. Thus, GPRv72 was revealed to be a novel clone having homology to GPCR. The 
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amino acid sequence of GPRv72 was analyzed by searching known sequences with an analysis program (BLAST 2.0); 
the result (data with the E-value lower than e-24) is shown in Table 1 8. 

Table 18 



Hit (ID) 


E-value 


Identities % 


Description 


AF091890 


4e-29 


32 


G-prbtein coupled receptor RE2-Homo sapiens - 


U81982 


3e-25 


30 


alpha 1a-adrenoceptor-Oryctolagus cuniculus. 


S71323 


6e-25 


32 


alpha-1 A adrenergic receptor-Japanese medaka 


D63859 


6e-25 


32 


alphal A-adrenoceptor-Oryzias latipes 


U07126 


8e-25 


29 


alphalc adrenergic receptor- Rattus norvegicus 


U03866 


8e-25 


30 


adrenergic alpha-1c receptor protein-Human 


AFO 13261 


8e-25 


30 


alpha 1 A adrenergic receptor isoform 4-Homo sapiens 


L31774 


8e-25 


30 


alpha-1 C-adrenergic receptor-Human 


D32202 


8e-25 


30* 


alpha 1C adrenergic receptor isoform 2-Human 


D32201 


8e-25 


30 


alpha 1C adrenergic receptor isoform 3-Human 


D25235 


8e-25 


30 


alphal C adrenergic receptor 



2. Prediction of transmembrane domain 



25 [0171] The amino acid sequence of GPRv72 was analyzed according to the method of Kyte-Doolittle (J. Kyte and 
R. F. Doolittle, (1 982), J. Mol. Biol., '1 57,1 05-132.), for obtaining a hydropathy plot and used to predict the transmem- 
brane domain. The result showed that GPRv72 had seven transmembrane domains (TM1-TM7) (Figure 31). 

3. HMMPfam search 

30 

[0172] Using the amino acid sequence of GPRv72 as the query, PFAM search based on the hidden Markov model 
(HMMPFAM (Sonnhammer EL, et al., Nucleic Acids Res 1998 Jan 1 ; 26 (1) :320-322)) was carried out. The search 
was carried out with the hidden Markov model of HMMER version 2.1 (http://hmmer.wustl.edu/) and the PFAM database 
of Pfam Version 5.5 (http://www.sanger.ac.uk/Software/Pfam/index.shtml). 
35 [0173] The result indicated that GPRv72 comprises tm7_1 (Rhodopsin family). The result of HMMPfam search is 
shown in Table 1 9. 



Table 19 



40 


Hit 


Score 


Expect 


Q from 


Qto 


Description 




. 7tm_1 


196.1 


4.7e-61 


48 


454 


7 transmembrane receptor 
(rhodopsin family) 


45 


Hit: name of the domain deduced by the search. 
Score: the higher the value, the higher the reliability. 
Expext: as the value approaches 0, the reliability becomes higher. 
Q from: the start position of the deduced domain. 
Q to: the termination position of the deduced domain. 
Description: explanation of the deduced domain. 



4. Amino acid sequence alignment 



[01 74] The amino acid sequences of G PRv72 and proteins shown in Table 1 8 were aligned together by using Clustalw 
1 .7 (Figures 32 to 34). The result showed that GPRv72 comprise seven transmembrane domains (### UUtt) and Cys 
(Cys marked with "@") participating in specific S-S bonding of GPCR. 
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Industrial Applicability 

[0175] The present invention provided novel G protein-coupled receptors (GPRv8, GPRv12, GPRv16, GPRv21, 
GPRv40, GPRv47, GPRv51 , GPRv71 , and GPRv72), the genes encoding the proteins, vectors containing the genes, 
5 host cells containing the vectors, and a method for producing the proteins. Further, the present invention provided a 
screening method for compounds modifying the activities of the proteins. The proteins and genes of the present in- 
vention and compounds modifying the activity of the proteins, are expected to be used for the development of new 
preventives and therapeutics for the diseases, with which the G protein-coupled receptors of the present invention are 
associated. 

10 



15 



20 



25 



30 



35 



40 



45 



50 
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SEQUENCE LISTING 

<110> HELIX RESEARCH INSTITUTE 

<120> NOVEL GUANOSINE TRIPHOSPHATE-BINDING PROTEIN-COUPLED RECEPTORS AND GENES 
THEREOF, AND PRODUCTION AND USES THEREOF 

<130> H1-113DP1PCT 

<140> 
<141> 

<150> JP 1999-375152 
<151> 1999-12-28 

<150> JP 2000-101339 
<151> 2000-03-31 

<160> 63 

<170> Patentln Ver.. 2.1 



<210> 1 

<211> 371 _ : 

<212> PRT 

<213> Homo sapiens 

<400> 1 

Met Pro Ala Asn Phe Thr Glu Gly Ser Phe Asp Ser Ser Gly Thr Gly 
1 5 10 15 

Gin Thx Leu Asp Ser Ser Pro Val Ala Cys Thr Glu Thr Val Thr Phe 
20 25 30 

Thr Glu Val Val Glu Gly Lys Glu Trp Gly Ser Phe Tyr Tyr Ser Phe 
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35 40 45 

Lys Thr Glu Gin Leu lie Thr Leu Trp Val Leu Phe Yal Phe Thr He 
50 55 60 

Val Gly Asn Ser Val Val Leu Phe Ser Thr Trp Arg Arg Lys Lys Lys 
65 . 70 75 80 

Ser Arg Met Thr Phe Phe Val Thr Gin Leu Ala He Thr Asp Ser Phe 
85 90 95 

Thr Gly Leu Val Asn He Leu Thr Asp He Asn Trp Arg Phe Thr Gly 
100 105 110 

Asp Phe Thr Ala Pro Asp Leu Val Cys Arg Yal Yal Arg Tyr Leu Gin 
115 120 125 

Val Val Leu Leu Tyr Ala Ser Thr Tyr Val Leu Val Ser Leu Ser He 
130 135 140 

Asp Arg Tyr His Ala He Yal Tyr Pro Met Lys Phe Leu Gin Giy Glu 
145 150 155 160 

Lys Gin Ala Arg Val Leu He Val He Ala Trp Ser Leu Ser Phe Leu 
165 170 175 

Phe Ser He Pro Thr Leu He He Phe Gly Lys Arg Thr Leu Ser Asn 
180 185 190 

Gly Glu Val Gin Cys Trp Ala Leu Trp Pro Asp Asp Ser Tyr Trp Thr 
195 200 205 

Pro Tyr Met Thr He Val Ala Phe Leu Val Tyr Phe He Pro Leu Thr 
210 215 220 

He He Ser He Met Tyr Gly. He Val He Arg Thr He Trp He Lys 
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225 230 235 240 

Ser Lys Thr Tyr Glu Thr Val lie Ser Asn Cys Ser Asp Gly Lys Leu 
245 250 255 

Cys Ser Ser Tyr Asn Arg Gly Leu lie Ser Lys Ala Lys lie Lys Ala 
260 265 270 

lie Lys Tyr Ser He lie lie He Leu Ala Phe He Cys Cys Trp Ser 
275 280 285 

Pro Tyr Phe Leu Phe Asp lie Leu Asp Asn Phe Asn Leu Leu Pro Asp 
290 295 300 

Thr Gin Glu Arg Phe Tyr Ala Ser Val lie He Gin Asn Leu Pro Ala 
305 310 315 320 

Leu Asn Ser Ala He Asn Pro Leu He Tyr Cys Val Phe Ser Ser Ser 
325 330 335 

lie Ser Phe Pro Cys Arg Glu Gin Arg Ser Gin Asp Ser Arg Met Thr 
340 345 350 

Phe Arg Glu Arg Thr Glu Arg His Glu Met Gin He Leu Ser Lys Pro 
355 360 365 

Glu Phe He 
370 



<210> 2 

<211> 363 

<212> PRT 

<213> Homo sapiens 

<400> 2 
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Met Gly Pro Gly Glu Ala Leu Leu Ala Gly Leu Leu Val Met Val Leu 
.1 5 10 15 

Ala Val Ala Leu Leu Ser Asn Ala Leu Val Leu Leu Cys Cys Ala Tyr 
20 25 . 30 

Ser Ala Glu Leu Arg Thr Arg Ala Ser Gly Val Leu Leu Val Asn Leu 
35 40 45 

Ser Leu Gly His Leu Leu Leu Ala Ala Leu Asp Met Pro Phe Thr Leu 
50 55 60 

Leu Gly Val Met Arg Gly Arg Thr Pro Ser Ala Pro Gly Ala Cys Gin 
65 70 75 80 

Val lie Gly Phe Leu Asp Thr Phe Leu Ala Ser Asn Ala Ala Leu Ser 
85 90 95 

Val Ala Ala Leu Ser Ala Asp Gin Trp Leu Ala Val Gly Phe Pro Leu 
100 105 110 

Arg Tyr Ala Gly Arg Leu Arg Pro Arg Tyr Ala Gly Leu Leu Leu Gly 
115 1 " 120 125 

Cys Ala Trp Gly Gin Ser Leu Ala Phe Ser Gly Ala Ala Leu Gly Cys 
130 135 140 

Ser Trp Leu Gly Tyr Ser Ser Ala Phe Ala Ser Cys Ser Leu Arg Leu 
145 150 155 160 

Pro Pro Glu Pro Glu Arg Pro Arg Phe Ala Ala Phe Thr Ala Thr Leu 
165 170 175 



His Ala Val Gly Phe Val Leu Pro Leu Ala Val Leu Cys Leu Thr Ser 
180 185 190 
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Leu Gin Val His Arg Yal Ala Arg Arg His Cys Gin Arg Met Asp Thr 
195 200 205 

Val Thr Met Lys Ala Leu Ala Leu Leu Ala Asp Leu His Pro Ser Val 
210 215 220 

Arg Gin Arg Cys Leu He Gin Gin Lys Arg Arg Arg His Arg Ala Thr 
225 230 235 240 

Arg Lys He Gly He Ala He Ala Thr Phe Leu lie Cys Phe Ala Pro 
245 . 250 255 

Tyr Val Met Thr Arg Leu Ala Glu Leu Val Pro Phe Val Thr Val Asn 
260 265 270 

Ala Gin Trp Gly He Leu Ser Lys Cys Leu Thr Tyr Ser Lys Ala Val 
275 280 285 

Ala Asp Pro Phe Thr Tyr Ser Leu Leu Arg Arg Pro Phe Arg Gin Val 
290 295 300 

Leu Ala Gly Met Val His Arg Leu Leu Lys Arg Thr Pro Arg Pro Ala 
305 310 315 320 

Ser Thr His Asp Ser Ser Leu Asp Val Ala Gly Met Val His Gin Leu 
325 330 335 

Leu Lys Arg Thr Pro Arg Pro Ala Ser Thr His Asn Gly Ser Val Asp 
340 345 350 

Thr Glu Asn Asp Ser Cys Leu Gin Gin Thr His 
355 360 



<210> 3 
<211> 4L9 
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<212> PRT 

<213> Homo sapiens 
<400> 3 

Met Leu Ala Ala Ala Phe Ala Asp Ser Asn Ser Ser Ser Met Asn Val 

1- 5 10 15' 

Ser Phe Ala His Leu His Phe Ala Gly Gly Tyr Leu Pro Ser Asp Ser 
20 25 30 

Gin Asp Trp Arg Thr lie lie Pro Ala Leu Leu Val Ala Val Cys Leu 

35 t 40 45 

Val Gly Phe Val Gly Asn Leu Cys Val He Gly He Leu Leu His Asn 
50 55 60 

Ala Trp Lys Gly Lys Pro Ser Met He His Ser Leu He Leu Asn Leu 
65 70 75 80 

Ser Leu Ala Asp Leu Ser Leu Leu Leu Phe Ser Ala Pro He Arg Ala 
85 90 95 

Thr Ala Tyr Ser Lys Ser Val Trp Asp Leu Gly Trp Phe Val Cys Lys 
100 105 110 

Ser Ser Asp Trp Phe He His Thr Cys Met Ala Ala Lys Ser Leu Thr 
115 ' 120 125 

He Val Val Val Ala Lys Val Cys Phe Met Tyr Ala Ser Asp Pro Ala 
130 135 140 

Lys Gin Val Ser He His Asn Tyr Thr He Trp Ser Val Leu Val Ala 
145 150 155 160 

He Trp Thr Val Ala Ser Leu Leu Pro Leu Pro Glu Trp Phe Phe Ser 
155 170 175 
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Thr He Arg His His Glu Gly Val Glu Met Cys Leu Val Asp Val Pro 
180. 185 190 

Ala Val Ala Glu Glu Phe Met Ser Met Phe Gly Lys Leu Tyr Pro Leu 
195 200 205 

Leu Ala Phe Gly Leu Pro Leu Phe Phe Ala Ser Phe Tyr Phe Trp Arg 
210 215 220 

Ala Tyr Asp Gin Cys Lys Lys Arg Gly Thr Lys Thr Gin Asn Leu Arg 
225 230 235 240 

Asn Gin lie Arg Ser Lys Gin Val Thr Val Met Leu Leu Ser lie Ala 
245 250 255 

He He Ser Ala Val Leu Trp Leu Pro Glu Trp Val Ala Trp Leu Trp 
260 265 270 

Val Trp His Leu Lys Ala Ala Gly Pro Ala Pro Pro Gin Gly Phe He 
275 280 285 

Ala Leu Ser Gin Val Leu Met Phe Ser lie Ser Ser Ala Asn Pro Leu 
290 295 300 

He Phe Leu Val Met Ser Glu Glu Phe Arg Glu Gly Leu Lys Gly Val 
305 310 315 320 

Trp Lys Trp Met He Thr Lys Lys Pro Pro Thr Val Ser Glu Ser Gin 
325 330 335 

Glu Thr Pro Ala Gly Asn Ser Glu Gly Leu Pro Asp Lys Val Pro Ser 
340 345 350 



Pro Glu Ser Pro Ala Ser lie Pro Glu Lys Glu Lys Pro Ser Ser Pro 
355 360 365 
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Ser Ser Gly Lys Gly Lys Thr Glu Lys Ala Glu lie Pro lie Leu Pro 
370 375 380 

Asp Val Glu Gin Phe Trp His Glu Arg Asp Thr Val Pro Ser Val Gin 
385 390 395 400 

Asp Asn Asp Pro He Pro Trp Glu His Glu Asp Gin Glu Thr Gly Glu 
405 410 415 

Gly Val Lys 



<210> 4 
<211> 393 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Met Glu Thr Thr Met Gly Phe Met Asp Asp Asn Ala Thr Asn Thr Ser 
1 5 10 15 

Thr Ser Phe Leu Ser Val Leu Asn Pro His Gly Ala His Ala Thr Ser 
20 25 30 

Phe Pro Phe Asn Phe Ser Tyr Ser Asp Tyr Asp Met Pro Leu Asp Glu 
35 40 45 

Asp Glu Asp Val Thr Asn Ser Arg Thr Phe Phe Ala Ala Lys He Val 
50 55 60 

He Gly Met Ala Leu Val Gly He Met Leu Val Cys Gly lie Gly Asn 
65 70 75 80 

Phe He Phe lie Ala Ala Leu Val Arg Tyr Lys Lys Leu Arg Asn Leu 
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85 90 95 

Thr Asn Leu Leu lie Ala Asn Leu Ala He Ser Asp Phe Leu Val Ala 
100 105 110 

He Val Cys Cys Pro Phe Glu Met Asp Tyr Tyr Val Val Arg Gin Leu 
115 . 120 125 

Ser Trp Glu His Gly His Val Leu Cys Thr Ser Val Asn Tyr Leu Arg 
130 135 140 

Thr Val Ser Leu. Tyr Val Ser Thr Asn Ala Leu Leu Ala He Ala He 
145 150 155 160 

Asp Arg Tyr Leu Ala He Val His Pro Leu Arg Pro Arg Met Lys Cys 
165 170 175 

Gin Thr Ala Thr Gly Leu He Ala Leu Val Trp Thr Val Ser He Leu 
180 185 190 

He Ala He Pro Ser Ala Tyr Phe Thr Thr Glu Thr Val Leu Val He 
195 200 205 

Val Lys Ser Gin Glu Lys He Phe Cys Gly Gin He Trp Pro Val Asp 
210 215 220 

Gin Gin Leu Tyr Tyr Lys Ser Tyr Phe Leu Phe He Phe Gly He Glu 
225 230 235 240 

Phe Val Gly Pro Val Val Thr Met Thr Leu Cys Tyr Ala Arg He Ser 
245 250 255 

Arg Glu Leu Trp Phe Lys Ala Val Pro Gly Phe Gin Thr Glu Gin He 
260 265 270 

Arg Lys Arg Leu Arg Cys Arg Arg Lys Thr Val Leu Val Leu Met Cys 
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275 280 285 

He Leu Thr Ala Tyr Val Leu Cys Trp Ala Pro Phe Tyr Gly Phe Thr 
290 295 300 

He Val Arg Asp Phe Phe Pro Thr Val Phe Val Lys Glu Lys His Tyr 
305 310 3i5 320 

Leu Thr Ala Phe Tyr He Val Glu Cys He Ala Met Ser Asn Ser Met 
325 330 335 

lie Asn Thr Leu Cys Phe Val Thr Val Lys Asn Asp Thr Val Lys Tyr 
340 345 350 

Phe Lys Lys He Met Leu Leu His Trp Lys Ala Ser Tyr Asn Gly Gly 
355 360 365 

Lys Ser Ser Ala Asp Leu Asp Leu Lys Thr He Gly Met Pro Ala Thr 
370 375 380 

Glu Glu Val Asp Cys He Arg Leu Lys 
385 390 



<210> 5 
<211> 1116 
<212> DNA 

<213> Homo sapiens 
<400> 5 

atgccagcca acttcacaga gggcagcttc gattccagtg ggaccgggca gacgctggat 60 
tcttccccag tggcttgcac tgaaacagtg acttttactg aagtggtgga aggaaaggaa 120 
tggggttcct tctactactcctttaagact gagcaattga taactctgtg ggtcctcttt 180 
gtttttacca ttgttggaaa ctccgttgtg cttttttcca catggaggag aaagaagaag 240 
tcaagaatga ccttctttgt gactcagctg gccatcacag attctttcac aggactggtc 300 
aacatcttga cagatattaa ttggcgattc actggagact tcacggcacc tgacctggtt 360 
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tgccgagtgg tccgctattt gcaggttgtg ctgctctacg cctctaccta cgtcctggtg 420 
tccctcagca tagacagata ccatgccatc gtctacccca tgaagttcct tcaaggagaa 480 
aagcaagcca gggtcctcat tgtgatcgcc tggagcctgt cttttctgtt ctccattccc 540 
accctgatca tatttgggaa gaggacactg tccaacggtg aagtgcagtg ctgggccctg 600 
tggcctgacg actcctactg gaccccatac atgaccatcg tggccttcct ggtgtacttc 660 
atccctctga caatcatcag catcatgtat ggcattgtga tccgaactat ttggattaaa 720 
agcaaaacct acgaaacagt gatttccaac tgctcagatg ggaaactgtg cagcagctat 780 
aaccgaggac tcatctcaaa ggcaaaaatc aaggctatca agtatagcat catcatcatt 840 
cttgccttca tctgctgttg gagtccatac ttcctgtttg acattttgga caatttcaac 900 
ctccttccag acacccagga gcgtttctat gcctctgtga tcattcagaa cctgccagca 960 
ttgaatagtg ccatcaaccc cctcatctac tgtgtcttca gcagctccat ctctttcccc 1020 
tgcagggagc aaagatcaca ggattccaga atgacgttcc gggagagaac tgagaggcat 1080 
gagatgcaga ttctgtccaa gccagaattc atctag 1116 

<210> 6 
<211> 1092 
<212> DNA 

<213> Homo sapiens 
<400> 6 

atgggccccg gcgaggcgct gctggcgggt ctcctggtga tggtactggc cgtggcgctg 60 
ctatccaacg cactggtgct gctttgttgc gcctacagcg ctgagctccg cactcgagcc 120 
tcaggcgtcc tcctggtgaa tctgtctctg ggccacctgc tgctggcggc gctggacatg 180 
cccttcacgc tgctcggtgt gatgcgcggg cggacaccgt cggcgcccgg cgcatgccaa 240 
gtcattggct tcctggacac cttcctggcg tccaacgcgg cgctgagcgt ggcggcgctg 300 
agcgcagacc agtggctggc agtgggcttc ccactgcgct acgccggacg cctgcgaccg 360 
cgctatgccg gcctgctg;ct gggctgtgcc tggggacagt cgctggcctt ctcaggcgct 420 
gcacttggct gctcgtggct tggctacagc agcgccttcg cgtcctgttc gctgcgcctg 480 
ccgcccgagc ctgagcgtcc gcgcttcgca gccttcaccg ccacgctcca tgccgtgggc 540 
ttcgtgctgc cgctggcggt gctctgcctc acctcgctcc aggtgcaccg ggtggcacgc 600 
agacactgcc agcgcatgga caccgtcacc atgaaggcgc tcgcgctgct cgccgacctg 660 
caccccagtg tgcggcagcg ctgcctcatc cagcagaagc ggcgccgcca ccgcgccacc 720 
aggaagattg gcattgctat tgcgaccttc ctcatctgct ttgccccgta tgtcatgacc 780 
aggctggcgg agctcgtgcc cttcgtcacc gtgaacgccc agtggggcat cctcagcaag 840 
tgcctgacct acagcaaggc ggtggccgac ccgttcacgt actctctgct ccgccggccg 900 
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ttccgccaag tcctggccgg catggtgcac cggctgctga agagaacccc gcgcccagca 960 

tccacccatg acagctctct ggatgtggcc ggcatggtgc accagctgct gaagagaacc 1020 

ccgcgcccag cgtccaccca caacggctct gtggacacag agaatgattc ctgcctgcag 1080 
cagacacact ga 1092 



<210> 7 
<211> 1260 
<212> DNA 

<213> Homo sapiens 
<400> 7 

atgctggcag ctgcctttgc agactctaac 
ctccactttg ccggagggta cctgccctct 
gctctcttgg tggctgtctg cctggtgggc 
ctccttcaca atgcttggaa aggaaagcca 
agcctggctg atctctccct cctgctgttt 
aaaagtgttt gggatctagg ctggtttgtc 
tgcatggcag ccaagagcct gacaatcgtt 
agtgacccag ccaagcaagt gagtatccac 
atctggactg tggctagcct gttacccctg 
catgaaggtg tggaaatgtg cctcgtggat 
atgtttggta agctctaccc actcctggca 
tatttctgga gagcttatga ccaatgtaaa 
aaccagatac gctcaaagca agtcacagtg 
gtcttgtggc tccccgaatg ggtagcttgg 
ccggccccac cacaaggttt catagccctg 
gcaaatcctc tcatttttct tgtgatgtcg 
tggaaatgga tgataaccaa aaaacctcca 
ggcaactcag agggtcttcc tgacaaggtt 
gaaaaagaga aacccagctc tccctcctct 
cccatccttc ctgacgtaga gcagttttgg 
gacaatgacc ctatcccctg ggaacatgaa 



tccagcagca tgaatgtgtc ctttgctcac 60 
gattcccagg actggagaac catcatcccg 120 
ttcgtgggaa acctgtgtgt gattggcatc 180 
tccatgatcc actccctgat tctgaatctc 240 
tctgcaccta tccgagctac ggcgtactcc 300 
tgcaagtcct ctgactggtt tatccacaca 360 
gtggtggcca aagtatgctt catgtatgca 420 
aactacacca tctggtcagt gctggtggcc 480 
ccggaatggt tctttagcac catcaggcat 540 
gtaccagctg tggctgaaga gtttatgtcg 600 
tttggccttc cattattttt tgccagcttt 660 
aaacgaggaa ctaagactca aaatcttaga 720 
atgctgctga gcattgccat catctctgct 780 
ctgtgggtat ggcatctgaa ggctgcaggc 840 
tctcaagtct tgatgttttc catctcttca 900 
gaagagttca gggaaggctt gaaaggtgta 960 
actgtctcag agtctcagga aacaccagct 1020 
ccatctccag aatccccagc atccatacca 1080 
ggcaaaggga aaactgagaa ggcagagatt 1140 
catgagaggg acacagtccc ttctgtacag 1200 
gatcaagaga caggggaagg tgttaaatag 1260 



<210> 8 
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<211> 1182 
<212> DNA 

<213> Homo sapiens 
<400> 8 

atggagacca ccatggggtt catggatgac aatgccacca acacttccac cagcttcctt 60 
tctgtgctca accctcatgg agcccatgcc acttccttcc cattcaactt cagctacagc 120 
gactatgata tgcctttgga tgaagatgag gatgtgacca attccaggac gttc$ttgct 180 
gccaagattg tcattgggat ggccctggtg ggcatcatgc tggtctgcgg cattggaaac 240 
ttcatcttta tcgctgccct ggtccgctac aagaaactgc gcaacctcac caacctgctc 300 
atcgccaacc tggccatctc tgacttcctg gtggccattg tctgctgccc ctttgagatg 360 
gactactatg tggtgcgcca gctctcctgg gagcacggcc acgtcctgtg cacctctgtc 420 
aactacctgc gcactgtctc tctctatgtc tccaccaatg ccctgctggc catcgccatt 480 
gacaggtatc tggctattgt ccatccgctg agaccacgga tgaagtgcca aacagccact 540 
ggcctgattg ccttggtgtg gacggtgtcc atcctgatcg ccatcccttc cgcctacttc 600 
accaccgaga cggtcctcgt cattgtcaag agccaggaaa agatcttctg cggccagatc 660 
tggcctgtgg accagcagct ctactacaag tcctacttcc tctttatctt tggcatagaa 720 
ttcgtgggcc ccgtggtcac catgaccctg tgctatgcc3 ggatctcccg ggagctctgg 780 
ttcaaggcgg tccctggatt ccagacagag cagatccgca agaggctgcg ctgccgcagg 840 
aagacggtcc tggtgctcat gtgcatcctc accgcctacg tgctatgctg ggcgcccttc 900 
tacggcttca ccatcgtgcg cgacttcttc cccaccgtgt ttgtgaagga gaagcactac 960 
ctcactgcct tctacatcgt cgagtgcatc gccatgagca acagcatgat caacactctg 102O 
tgcttcgtga ccgtcaagaa cgacaccgtc aagtacttca aaaagatcat gttgctccac 1080 
tggaaggctt cttacaatgg cggtaagtcc agtgcagacc tggacctcaa gacaattggg 1140 
atgcctgcca ccgaagaggt ggactgcatc agactaaaat aa 1182 

<210> 9 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequenced artificially 
synthesized primer sequence 
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<400> 9 

atgccagcca acttcacaga gggcagct 

<210> 10 
<211> 28 
<2i2> l/Na 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequenced artificially 
synthesized primer sequence 

<400> 10 

ctagatgaat tctggcttgg acagaatc 

<210> 11 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequenced artificially 
synthesized primer sequence 

<400> 11 

atgggccccg gcgaggcgct gctggcgg 

<210> 12 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequenced artificially 
synthesized primer sequence 

<400> 12 

tcagtgtgtc tgctgcaggc aggaatca 

<210> 13 
<211> 30 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized primer sequence 

<400> 13 

atgctggcag ctgcctttgc agactctaac 

<210> 14 ' 
<211> 30 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequenced artificially 
synthesized primer sequence 

<400> 14. 

ctatttaaca ccttcccctg tctcttgatc 

<210> 15 
<211> 28 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequenced artificially 
synthesized primer sequence 

<400> 15 

atggagacca ccatggggtt catggatg 



<210> 16 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequenced artificially 
synthesized primer sequence 

<400> 16 

ttattttagt ctgatgcagt ccacctcttc 



<210> 17 

<211> 434 

<212> PRT 

<213> Homo sapiens 

<400> 17 

Met Glu Asp Leu Phe Ser Pro Ser lie Leu Pro Pro Ala Pro Asn He 
1 5 10 15 

Ser Val Pro He Leu Leu Gly Trp Gly Leu Asn Leu Thr Leu Gly Gin 
20 25 30 

Gly Ala Pro Ala Ser Gly Pro Pro Ser Arg Arg Val Arg Leu Val Phe 
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35 40 45 

Leu Gly Yal He Leu Val Yal Ala Val Ala Gly Asn Thr Thr Val Leu 
50 55 60 

Cys Arg Leu Cys Gly Gly Gly Gly Pro Trp Ala Gly Pro Lys Arg Arg 
65 70 75 80 

Lys Met Asp Phe. Leu Leu Val Gin Leu Ala Leu Ala Asp Leu Tyr Ala 
85 90 95 

Cys Gly Gly Thr Ala Leu Ser Gin Leu Ala Trp Glu Leu Leu Gly Glu 
100 105 HO 

Pro Arg Ala Ala Thr Gly Asp Leu Ala Cys Arg Phe Leu Gin Leu Leu 
115 120 125 

Gin Ala Ser Gly Arg Gly Ala Ser Ala His Leu Val Val Leu He Ala 
130 135 140 

i 

Leu Glu Arg Arg Arg Ala Val Arg Leu Pro His Gly Arg Pro Leu Pro 
145 150 155 160 

Ala Arg Ala Leu Ala Ala Leu Gly Trp Leu Leu Ala Leu Leu Leu Ala 
165 170 175 

Leu Pro Pro Ala Phe Val Val Arg Gly Asp Ser Pro Ser Pro Leu Pro 
180 185 190 

Pro Pro Pro Pro Pro Thr Ser Leu Gin Pro Gly Ala Pro Pro Ala Ala 
195 200 205 

Arg Ala Trp Pro Gly Gin Arg Arg Cys His Gly He Phe Ala Pro Leu 
210 215 220 

Pro Arg Trp His Leu Gin Val Tyr Ala Phe Tyr Glu Ala Val Ala Gly 
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225 



230 



235 



240 



Phe Val Ala Pro Val Thr Val Leu Gly Val Ala Cys Gly His Leu Leu 
245 v 250 255 

Ser Val Trp Trp Arg His Arg Pro Gin Ala Pro Ala Ala Ala Ala Pro 

260 265 270 

Trp Ser Ala Ser Pro Gly Arg Ala Pro Ala Pro Ser Ala Leu Pro Arg 
275 ' 280 285 

Ala Lys Val Gin Ser Leu Lys Met Ser Leu Leu Leu Ala Leu Leu Phe 
290 295 300 

Val Gly Cys Glu Leu Pro Tyr Phe Ala Ala Arg Leu Ala Ala Ala Trp 
305 310 315 320 

Ser Ser Gly Pro Ala Gly Asp Trp Glu Gly Glu Gly Leu Ser Ala Ala 
325 330 335 

Leu Arg Val Val Ala Met Ala Asn Ser Ala Leu Asn Pro Phe Val Tyr 
340 345 350 

Leu Phe Phe Gin Ala Gly Asp Cys Arg Leu Arg Arg Gin Leu Arg Lys 
355 360 365 

Arg Leu Gly Ser Leu Cys Cys Ala Pro Gin Gly Gly Ala Glu Asp Glu 
370 375 380 

Glu Gly Pro Arg Gly His Gin Ala Leu Tyr Arg Gin Arg Trp Pro His 
385 390 395 400 

Pro His Tyr His His Ala Arg Arg Glu Pro Leu Asp Glu Gly Gly Leu 
405 410 415 



Arg Pro Pro Pro Pro Arg Pro Arg Pro Leu Pro Cys Ser Cys Glu Ser 
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420 425 430 



Ala Phe 



<210> 18 
<211> 451 
<212> PRT 

<213> Homo sapiens 
<400> 18 

Met Glu Ser Ser Pro He Pro Gin Ser Ser Gly Asn Ser Ser Thr Leu 
1 5 10 15 

Gly Arg Val Pro Gin Thr Pro Gly Pro Ser Thr Ala Ser Gly Val Pro 
20 25 30 

Glu Val Gly Leu Arg Asp Val Ala Ser Glu Ser Val Ala Leu Phe Phe 
35 40 45 

Met Leu Leu Leu Asp Leu Thr Ala Val Ala Gly Asn Ala Ala Val Met 
50 55 60 

Ala Val lie Ala Lys Thr Pro Ala Leu Arg Lys Phe Val Phe Val Phe 
65 70 75 80 

His Leu Cys Leu Val Asp Leu Leu Ala Ala Leu Thr Leu Met Pro Leu 
85 90 95 

Ala Met Leu Ser Ser Ser Ala Leu Phe Asp His Ala Leu Phe Gly Glu 
100 . 105 110 

Val Ala Cys Arg Leu Tyr Leu Phe Leu Ser Val Cys Phe Val Ser Leu 
115 120 125 
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Ala lie Leu Ser Val Ser Ala lie Asn Val Glu Arg Tyr Tyr Tyr Val 
130 135 140 

Val His Pro Met Arg Tyr Glu Val Arg Met Thr Leu Gly Leu Val Ala 
145 150 155 160 

Ser Val Leu Yai lily Vai Trp Vai Lys Ala Leu Aia wex Ala Ser Val 
165 170 175 

Pro Val Leu Gly Arg Val Ser Trp Glu Glu Gly Ala Pro- Ser Val Pro 
180 185 190 

Pro Gly Cys Ser Leu Gin Trp Ser His Ser Ala Tyr Cys Gin Leu Phe 
195 200 205 

Val Val Val Phe Ala Val Leu Tyr Phe Leu Leu Pro Leu Leu Leu He 
210 215 220 

Leu Val Val Tyr Cys Ser Met Phe Arg Val Ala Arg Val Ala Ala Met 
225 230 235 240 

Gin His Gly Pro Leu Pro Thr Trp Met Glu Thr Pro Arg Gin Arg Ser 
245 250 255 

Glu Ser Leu Ser; Ser Arg Ser Thr Met Val Thr Ser Ser Gly Ala Pro 
260 265 270 

Gin Thr Thr Pro His Arg Thr Phe Gly Gly Gly Lys Ala Ala Val Val 
275 280 285 

Leu Leu Ala Val Gly Gly Gin Phe Leu Leu Cys Trp Leu Pro Tyr Phe 
290 295 300 

Ser Phe His Leu Tyr Val Ala Leu Ser Ala Gin Pro He Ser Thr Gly 
305 310 315 320 
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Gin Val Glu Ser Val Val Thr Trp He Gly Tyr Phe Cys Phe Thr Ser 
325 330 335 

Asn Pro Phe Phe Tyr Gly Cys Leu Asn Arg Gin lie Arg Gly Glu Leu 
340 345 350 

Ser Lys Gin Phe Val Cys Phe Phe Lys Pro Ala Pro Glu Glu Glu Leu 
355 360 365 

Arg Leu Pro Ser Arg Glu Gly Ser He Glu Glu Asn Phe Leu Gin Phe 
370 375 380 

Leu Gin Gly Thr Gly Cys Pro Ser Glu "Ser Trp Val Ser Arg Pro Leu 
385 390 395 400 

Pro Ser Pro Lys Gin Glu Pro Pro Ala Val Asp Phe Arg He Pro Gly 
405 410 415 

Gin He Ala Glu Glu Thr Ser Glu Phe Leu Glu Gin Gin Leu Thr Ser 
420 425 430 

Asp He He Met Ser Asp Ser Tyr Leu Arg Pro Ala Ala Ser Pro Arg 
435 440 445 

Leu Glu Ser 
450 



<210> 19 

<211> 321 

<212> PRT 

<213> Homo sapiens 

<400> 19 

Met Asn Gin Thr Leu Asn Ser Ser Gly Thr Val Glu Ser Ala Leu Asn 
1 5 10 15 
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Tyr Ser Arg Gly Ser Thr Val His Thr Ala Tyr Leu Val Leu Ser Ser 
20 25 30 

Leu Ala Met Phe Thr Cys Leu Cys Gly Met Ala Gly Asn Ser Met Val 
35 40 45 

He Trp Leu Leu Gly Phe Arg Met His Arg Asn Pro Phe Cys lie Tyr 
50 55 60 

He Leu Asn Leu Ala Ala Ala Asp Leu Leu Phe Leu Phe Ser Met Ala 
65 70 75 80 

Ser Thr Leu Ser Leu Glu Thr Gin Pro Leu Val Asn Thr Thr Asp Lys 
85 90 95 

Val His Glu Leu Met Lys Arg Leu Met Tyr Phe Ala Tyr Thr Val Gly 
100 105 110 

Leu Ser Leu Leu Thr Ala He Ser Thr Gin Arg Cys Leu Ser Val Leu 
115 120 125 

Phe Pro He Trp Phe Lys Cys His Arg Pro Arg His Leu Ser Ala Trp 
130 135 140 

Val Cys Gly Leu Leu Trp Thr Leu Cys Leu Leu Met Asn Gly Leu Thr 
145 150 155 160 

Ser Ser Phe Cys Ser Lys Phe Leu Lys Phe Asn Glu Asp Arg Cys Phe 
165 170 175 

Arg Val Asp Met Val Gin Ala Ala Leu He Met Gly Val Leu Thr Pro 
180 185 190 



Val Met Thr Leu Ser Ser Leu Thr Leu Phe Val Trp Val Arg Arg Ser 
195 200 205 
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Ser Gin Gin Trp Arg Arg Gin Pro Thr Arg Leu Phe Yal Val Yal Leu 
210 215 220 

Ala Ser Val Leu Val Phe Leu He Cys Ser Leu Pro Leu Ser He Tyr 
225 . 230 235 240 

Trp Phe Val Leu Tyr Trp Leu Ser Leu Pro Pro Glu Met Gin Yal Leu 
245 250 255 

Cys Phe Ser Leu Ser Arg Leu. Ser Ser Ser Val Ser Ser Ser Ala Asn 
260 265 270 

Pro Val He Tyr Phe Leu Val Gly Ser Arg Arg Ser His Arg Leu Pro 
275 280 285 

Thr Arg Ser Leu Gly Thr Val Leu Gin Gin Ala Leu Arg Glu Glu Pro 
290 295 300 

Glu Leu Glu Gly Gly Glu Thr Pro Thr Val Gly Thr Asn Glu Met Gly 
305 310 315 320 

Ala 



<210> 20 

<2U> 333 

<212> PRT 

<213> Homo sapiens 



<400> 20 

Met Glu Lys Yal Asp Met Asn Thr Ser Gin Glu Gin Gly Leu Cys Gin 
15 10 15 

Phe Ser Glu Lys Tyr Lys Gin Val Tyr Leu Ser Leu Ala Tyr Ser He 
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20 25 30 

He Phe lie Leu Gly Leu Pro Leu Asn Gly Thr Yal Leu Trp His Phe 
35 40 45 

Trp Gly Gin Thr Lys Arg Trp Ser Cys Ala Thr Thr Tyr Leu Val Asn 
50 55 60 

Leu Met Yal Ala Asp Leu Leu Tyr Val Leu Leu Pro Phe Leu lie He 
65 70 75 80 

Thr Tyr Ser Leu Asp Asp Arg Trp Pro Phe Gly Glu Leu Leu Cys Lys 
85 90 95 



Leu Val His Phe Leu Phe Tyr He Asn Leu Tyr Gly Ser He Leu Leu 
100 105 110 

Leu Thr Cys lie Ser Val His Gin Phe Leu Gly Val Cys His Pro Leu 
115 120 125 

Cys Ser Leu Pro Tyr Arg Thr Arg Arg His Ala Trp Leu Gly Thr Ser 
130 135 140 

Thr Thr Trp Ala Leu Val Val Leu Gin Leu Leu Pro Thr Leu Ala Phe 
145 150 155 160 

Ser His Thr Asp Tyr He Asn Gly Gin Met He Trp Tyr Asp Met Thr 
165 170 175 

Ser Gin Glu Asn Phe Asp Arg Leu Phe Ala Tyr Gly He Val Leu Thr 
180 185 190 



Leu Ser Gly Phe Leu Ser Leu Leu Gly His Phe Gly Val Leu Phe Thr 
195 200 205 



Asp Gly Gin Glu Pro Asp Gin Ala Arg Gly Glu Pro His Glu Asp Arg 
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210 215 220 

Gin His Ser Pro Ser Gin Val His Pro Asp His Pro Thr Gly Val Trp 
225 230 235 240 

Pro Leu His Pro Leu Phe Cys Ala Leu Pro Tyr His Ser Leu Leu Leu 
245 250 255 

Pro His His Leu Leu Ser Ala Phe Ser Gly Leu Pro Ala Leu Asp Gly 
260 265 270 

Ser Gin Cys Gly Leu Gin Asp Met Glu Ala Ser Gly Glu Cys Glu Gin 
275 280 285 

Leu Pro Gin Pro Ser Pro Val Leu Ser Phe Lys Gly Gly Lys Asn Arg 
290 295 300 

Val Arg Leu Leu Gin Lys Leu Arg Gin Asn Lys Leu Gly Glu His Pro 
305 310 315 320 

r 

Ala Gly Arg Lys Arg Cys Pro Gly Leu Asn Arg Ser Gly 
325 330 



<210> 21 
<211> 508 
<212> PRT 

<213> Homo sapiens 
<400> 21 

Met Thr Ser Thr Cys Thr Asn Ser Thr Arg Glu Ser Asn Ser Ser His 
1 5 10 15 

Thr . Cys Met Pro Leu Ser Lys Met Pro He Ser Leu Ala His Gly He 
20 25 30 
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lie Arg Ser Thr Val. Leu Val lie Phe Leu Ala Ala Ser Phe Yal Gly 
35 40 45 

Asn He Val Leu Ala Leu Val Leu Gin Arg Lys Pro Gin Leu Leu Gin 
50 55 60 

Val Thr Asn Arg Phe He Phe Asn Leu Leu Val Thr Asp Leu Leu Gin 
65 70 75 80 

He Ser Leu Val Ala Pro Trp Val Val Ala Thr Ser Yal Pro Leu Phe 
85. 90 95 

Trp Pro Leu Asn Ser His Phe Cys Thr Ala Leu Val Ser Leu Thr His 
100 105 110 

Leu Phe Ala Phe Ala Ser Val Asn Thr He Val Val Val Ser Val Asp 
115 120 125 

Arg Tyr Leu Ser lie lie His Pro Leu Ser Tyr Pro Ser Lys Met Thr 
130 135 140 

Gin Arg Arg Gly Tyr Leu Leu Leu Tyr Gly Thr Trp He Val Ala lie 
145 150 155 160 

Leu Gin Ser Thr Pro Pro Leu Tyr Gly Trp Gly Gin Ala Ala Phe Asp. 

165 170 175 

Glu Arg Asn Ala Leu Cys Ser Met lie Trp Gly Ala Ser Pro Ser Tyr 
180 185 190 

Thr lie Leu Ser Val Val Ser Phe He Val He Pro Leu lie Val Met 
195 200 205 

lie Ala Cys Tyr Ser Val Val Phe Cys Ala Ala Arg Arg Gin His Ala 
210 215 220 
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Leu Leu Tyr Asn Val Lys Arg His Ser Leu Glu Val Arg Val Lys Asp 
225 230 235 / 240 

Cys Val Glu Asn Glu Asp Glu Glu Gly Ala Glu Lys Lys Glu Glu Phe 
245 250 255 

Gin Asp Glu Ser Glu Phe Arg Arg Gin His Glu Gly Glu Val Lys Ala 
260 265 270 

Lys Glu Gly Arg Met Glu Ala Lys Asp Gly Ser Leu Lys Ala Lys Glu 
275 280 285 

Gly Ser Thr Gly Thr Ser Glu Ser Ser Yal Glu Ala Arg Gly Ser Glu 
290 295 300 

Glu Val Arg Glu Ser Ser Thr Val Ala Ser Asp Gly Ser Met Glu Gly 
305 310 315 320 

Lys Glu Gly Ser Thr Lys Val Glu Glu Asn Ser Met Lys Ala Asp Lys 
325 330 335 

Gly Arg Thr Glu Val Asn Gin Cys Ser lie Asp Leu Gly Glu Asp Asp 
340 345 350 

Met Glu Phe Gly Glu Asp Asp He Asn Phe Ser Glu Asp Asp Val Glu 
355 360 365 

Ala Val Asn He Pro Glu Ser Leu Pro Pro Ser Arg Arg Asn Ser Asn 
370 375 380 

Ser Asn Pro Pro Leu Pro Arg Cys Tyr Gin Cys Lys Ala Ala Lys Val 
385 390 395 400 



He Phe He He He Phe Ser Tyr Yal Leu Ser Leu Gly Pro Tyr Cys 
405 410 415 
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Phe Leu Ala Val Leu Ala Val Trp Val Asp Val Glu Thr Gin Val Pro 
420 425 430 

Gin Trp Val He Thr He lie He Trp Leu Phe Phe Leu Gin Cys Cys 
435 440 445 

He His Pro Tyr Val Tyr Gly Tyr Met His Lys Thr He Lys Lys Glu 
450 455 460 

He Gin Asp Met Leu Lys Lys Phe Phe Cys Lys Glu Lys Pro Pro Lys 
465 470 475 480 

Glu Asp Ser His Pro Asp Leu Pro Gly Thr Glu Gly Gly Thr Glu Gly 
485 490 495 

Lys He Val Pro Ser Tyr Asp Ser Ala Thr Phe Pro 
500 505 



<210> 22 
<211> 1305 
<212> DNA 



<213> Homo 


sapiens 




<400> 22 






atggaggatc 


tctttagccc 


ctcaattctg 


ttgctgggct 


ggggtctcaa 


cctgaccttg 


agccgccgcg 


tccgcctggt 


gttcctgggg 


accacagtgc 


tgtgccgcct 


gtgcggcggc 


aagatggact 


tcctgctggt 


gcagctggcc 


gcgctgtcac 


agctggcctg 


ggaactgctg 


gcgtgccgct 


tcctgcagct 


gctgcaggca 


gtgctcatcg 


ccctcgagcg 


ccggcgcgcg 


gcgcgtgccc 


tcgccgccct 


gggctggctg 


ttcgtggtgc 


gcggggactc 


cccctcgccg 


cagccaggcg 


cgcccccggc 


cgcccgcgcc 



ccgccggcgc ccaacatttc cgtgcccatc 60 

gggcaaggag cccctgcctc tgggccgccc 120 

gtcatcctgg tggtggcggt ggcaggcaac 180 

ggcgggccct gggcgggccc caagcgtcgc 240 

ctggcggacc tgtacgcgtg cgggggcacg 300 

ggcgagcccc gcgcggccac gggggacctg 360 

tccgggcggg gcgcctcggc ccacctcgtg 420 

gtgcgtcttc cgcacggccg gccgctgccc 480 

ctggcactgc tgctggcgct gcccccggcc 540 

ctgccgccgc cgccgccgcc aacgtccctg 600 

tggccggggc agcgtcgctg ccacgggatc 660 
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ttcgcgcccc tgccgcgctg gcacctgcag gtctacgcgt tctacgaggc cgtcgcgggc 720 
ttcgtcgcgc ctgttacggt cctgggcgtc gcttgcggcc acctactctc cgtctggtgg 780 
cggcaccggc cgcaggcccc cgcggctgca gcgccctggt cggcgagccc aggtcgagcc 840 
cctgcgccca gcgcgctgcc ccgcgccaag gtgcagagcc tgaagatgag cctgctgctg 900 
gcgctgctgt tcgtgggctg cgagctgccc tactttgccg cccggctggc ggccgcgtgg 960 

10 ' tcgtccgggc ccgcgggaga ctgggaggga gagggcctgt cggcggcgct gcgcgtggtg 1020 

gcgatggcca acagcgctct caatcccttc gtctacctct tcttccaggc gggcgactgc 1080 
cggctccggc gacagctgcg gaagcggctg ggctctctgt gctgcgcgcc gcagggaggc 1140 

15 gcggaggacg aggaggggcc ccggggccac caggcgctct accgccaacg ctggccccac 1200 

cctcattatc accatgctcg gcgggaaccg ctggacgagg gcggcttgcg cccaccccct 1260 
ccgcgcccca gacccctgcc ttgctcctgc gaaagtgcct tctag 1305 

20 

<210> 23 
<211> 1356 
25 <212> DNA 

<213> Homo sapiens 

<400> 23 

atggagtcct cacccatccc ccagtcatca gggaactctt ccactttggg gagggtccct 60 
caaaccccag gtccctctac tgccagtggg gtcccggagg tggggctacg ggatgttgct 120 
tcggaatctg tggccctctt cttcatgctc ctgctggact tgactgctgt ggctggcaat 180 
gccgctgtga tggccgtgat cgccaagacg cctgccctcc gaaaatttgt cttcgtcttc 240 
cacctctgcc* tggtggacct gctggctgcc ctgaccctca tgcccctggc catgctctcc 300 
agctctgccc tctttgacca cgccctcttt ggggaggtgg cctgccgcct ctacttgttt 360 
40 ctgagcgtgt gctttgtcag cctggccatc ctctcggtgt cagccatcaa tgtggagcgc 420 

tactattacg tagtccaccc catgcgctac gaggtgcgca tgacgctggg gctggtggcc 480 
tctgtgctgg tgggtgtgtg ggtgaaggcc ttggccatgg cttctgtgcc agtgttggga 540 
agggtctcct gggaggaagg agctcccagt gtccccccag gctgttcact ccagtggagc 600 
cacagtgcct actgccagct ttttgtggtg gtctttgctg tcctttactt tctgttgccc 660 
ctgctcctca tacttgtggt ctactgcagc atgttccgag tggcccgcgt ggctgccatg 720 
cagcacgggc cgctgcccac gtggatggag acaccccggc aacgctccga atctctcagc 780 
agccgctcca cgatggtcac cagctcgggg gccccccaga ccaccccaca ccggacgttt 840 
gggggaggga aagcagcagt ggttctcctg gctgtggggg gacagttcct gctctgttgg 900 
ttgccctact tctctttcca cctctatgtt gccctgagtg ctcagcccat ttcaactggg 960 
55 caggtggaga gtgtggtcac ctggattggc tacttttgct tcacttccaa ccctttcttc 1020 



30 



35 



45- 



50 



iMSDOCID: <EP 124364BA1_I_> 
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tatggatgtc tcaaccggca gatccggggg ^gagctcagca agcagtttgt ctgcttcttc 1080 
aagccagctc cagaggagga gctgaggctg cctagccggg agggctccat tgaggagaac 1140 
ttcctgcagt tccttcaggg gactggctgt ccttctgagt cctgggtttc ccgaccccta 1200 
cccagcccca agcaggagcc acctgctgtt gactttcgaa tcccaggcca gatagctgag 1260 
gagacctctg agttcctgga gcagcaactc accagcgaca tcatcatgtc agacagctac 1320 
ctccgtcctg ccgcctcacc ccggctggag tcatga 1356 

<210> 24 

<211> 966 

<212> DNA 

<213> Homo sapiens 

<400> 24 

atgaaccaga ctttgaatag cagtgggacc gtggagtcag ccctaaacta ttccagaggg 60 
agcacagtgc acacggccta cctggtgctg agctccctgg ccatgttcac ctgcctgtgc 120 
gggatggcag gcaacagcat ggtgatctgg ctgctgggct ttcgaatgca caggaacccc 180 
ttctgcatct atatcctcaa cctggcggca gccgacctcc tcttcctctt cagcatggct 240 
tccacgctca gcctggaaac ccagcccctg gtcaatacca ctgacaaggt ccacgagctg 300 
atgaagagac tgatgtactt tgcctacaca gtgggcctga gcctgctgac ggccatcagc 360 
acccagcgct gtctctctgt cctcttccct atctggttca agtgtcaccg gcccaggcac 420 
ctgtcagcct gggtgtgtgg cctgctgtgg acactctgtc tcctgatgaa cgggttgacc 480 
tcttccttct gcagcaagtt cttgaaattc aatgaagatc ggtgcttcag ggtggacatg 540 
gtccaggccg ccctcatcat gggggtctta accccagtga tgactctgtc cagcctgacc 600 
ctctttgtct gggtgcggag gagctcccag cagtggcggc ggcagcccac acggctgttc 660 
gtggtggtcc tggcctctgt cctggtgttc ctcatctgtt ccctgcctct gagcatctac 720 
tggtttgtgc tctactggtt gagcctgccg cccgagatgc aggtcctgtg cttcagcttg 780 
tcacgcctct cctcgtccgt aagcagcagc gccaaccccg tcatctactt cctggtgggc 840 
agccggagga gccacaggct gcccaccagg tccctgggga ctgtgctcca acaggcgctt 900 
cgcgaggagc ccgagctgga aggtggggag acgcccaccg tgggcaccaa tgagatgggg 960 
gcttga 966 

<210> 25 
<211> 1002 
<212> DNA 
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<213> Homo sapiens 
<400>25 

atggagaagg tggacatgaa tacatcacag gaacaaggtc tctgccagtt ctcagagaag 60 
tacaagcaag tctacctctc cctggcctac agtatcatct ttatcctagg gctgccacta 120 
aatggcactg tcttgtggca cttctggggc caaaccaagc gctggagctg tgccaccacc 180 
tatctggtga acctgatggt ggccgacctg ctttatgtgc tattgccctt cctcatcatc 240 
acctactcac tagatgacag gtggcccttc ggggagctgc tctgcaagct ggtgcacttc 300 
,ctgttctata tcaaccttta cggcagcatc ctgctgctga cctgca.tctc tgtgcaccag 360 
ttcctaggtg- tgtgccaccc actgtgttcg ctgccctacc ggacccgcag gcatgcctgg 420 
ctgggcacca gcaccacctg ggccctggtg gtcctccagc tgctgcccac actggccttc 480 
tcccacacgg actacatcaa tggccagatg atctggtatg acatgaccag ccaagagaat 540 
tttgatcggc tttttgccta cggcatagtt ctgacattgt ctggctttct ttccctcctt 600 
ggtcattttg gtgtgctatt cactgatggt caggagcctg atcaagccag aggagaacct 660 
catgaggaca ggcaacacag cccgagccag gtccatccgg accatcctac tggtgtgtgg 720 
cctcttcacc ctctgttttg tgcccttcca tatcactcgc tccttctacc tcaccatctg 780 
ctttctgctt tctcaggact gccagctctt gatggcagcc agtgtggcct acaagatatg 840 
gaggcctctg gtgagtgtga gcagctgcct caacccagtc ctgtactttc tttcaagggg 900 
ggcaaaaata gagtcaggct cctccagaaa ctgaggcaga acaagttggg tgagcatcca 960 
gctgggagga agagatgccc agggttgaac agatctgggt aa 1002 

<210> 26 
<211> 1527 
<212> DNA 
<213> Homo sapiens 

<400> 26 

atgacgtcca cctgcaccaa cagcacgcgc gagagtaaca gcagccacac 
ctctccaaaa tgcccatcag cctggcccac ggcatcatcc gctcaaccgt 
ttcctcgccg cctctttcgt cggcaacata gtgctggcgc tagtgttgca 
cagctgctgc aggtgaccaa ccgttttatc tttaacctcc tcgtcaccga 
atttcgctcg tggccccctg ggtggtggcc acctctgtgc ctctcttctg 
agccacttct gcacggccct ggttagcctc acccacctgt tcgccttcgc 
accattgtct tggtgtcagt ggatcgctac ttgtccatca tccaccctct 
tccaagatga cccagcgccg cggttacctg ctcctctatg gcacctggat 



gtgcatgccc 60 
gctggttatc 120 
gcgcaagccg 180 
cctgctgcag 240 
gcccctcaac 300 
cagcgtcaac 360 
ctcctacccg 420 
tgtggccatc 480 
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ctgcagagca ctcctccact ctacggctgg ggccaggctg cctttgatga gcgcaatgct 540 
ctctgctcca tgatctgggg ggccagcccc agctacacta ttctcagcgt ggtgtccttc 600 
atcgtcattc cactgattgt catgattgcc tgctactccg tggtgttctg tgcagcccgg 660 
aggcagcatg ctctgctgta caatgtcaag agacacagct tggaagtgcg agtcaaggac 720 
tgtgtggaga atgaggatga agagggagca gagaagaagg aggagttcca ggatgagagt 780 
gagtttcgcc gccagcatga aggtgaggtc aaggccaagg agggcagaat ggaagccaag 840 
gacggcagcc tgaaggccaa ggaaggaagc acggggacca gtgagagtag tgtagaggcc 900 
aggggcagcg aggaggtcag agagagcagc acggtggcca gcgacggcag catggagggt 960 
aaggaaggca gcaccaaagt tgaggagaac agcatgaagg cagacaaggg tcgcacagag 1020 
gtcaaccagt gcagcattga cttgggtgaa gatgacatgg agtttggtga agacgacatc 1080 
aatttcagtg aggatgacgt cgaggcagtg aacatcccgg agagcctccc acccagtcgt 1140 
cgtaacagca acagcaaccc tcctctgccc aggtgctacc agtgcaaagc tgctaaagtg 1200 
atcttcatca tcattttctc ctatgtgcta t.ccctggggc cctactgctt tttagcagtc 1260 
ctggccgtgt gggtggatgt cgaaacccag gtaccccagt gggtgatcac cataatcatc 1320 
tggcttttct tcctgcagtg ctgcatccac ccctatgtct atggctacat gcacaagacc 1380 
attaagaagg aaatccagga catgctgaag aagttcttct gcaaggaaaa gcccccgaaa 1440 
gaagatagcc acccagacct * gcccggaaca gagggtggga ctgaaggcaa gattgtccct 1500 
tcctacgatt ctgctacttt tccttga 1527 

<210> 27 
<21l> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequenced artificially 
synthesized primer sequence 

<400> 27 

atggaggatc tctttagccc ctcaattc 28 

<210> 28 
<211> 28 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:an artificially 
synthesized primer sequence 

<400> 28 

ctagaaggca ctttcgcagg agcaaggc 

<210> 29 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:an artificially 
synthesized primer sequence 

<400> 29 

atggagtcct cacccatccc ccagtcatc 

<210> 30 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized primer sequence 

<400> 30 

tcatgactcc agccggggtg aggcggcag 
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<210> 31 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence.'an artificially 
synthesized primer sequence 

<400> 31 

atgaaccaga ctttgaatag cagtgg 

<210> 32 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequenceian artificially 
synthesized primer sequence 

<400> 32 

tcaagccccc atctcattgg tgcccacg 

<210> 33 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence^an artificially 
synthesized primer sequence 

<400> 33 
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atggagaagg tggacatgaa tacatcac 

<210> 34 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence-'an artificially 
synthesized primer sequence 

<400> 34 

ttacccag;at ctgttcaacc ctgggcatc 

<210> 35 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence'.an artificially 
synthesized primer sequence 

<400> 35 ... 

atgacgtcca cctgcaccaa cagcacgc 

<210> 36 ■ 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :an artificially 
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synthesized primer sequence 
<400> 36 

tcaaggaaaa gtagcagaat cgtaggaag 

<210> 37 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequenced artificially 
synthesized primer sequence 

<400> 37 

ccaggagcgt ttctatgcct 

<210> 38 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:an artificially 
synthesized primer sequence 

<400> 38 

tgtgatcttt gctccctgca 
<210> 39 

<211> 28 - 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequenced artificially 
synthesized TaqMan probe sequence 

<220> 

<221> misc_binding 
<222> (1) 

<223> Label FAM (6-carboxy-f luorescein) 
<220> 

<221> misc_binding 

<222> (28) 

<223> Label TAMRA 

(6~carboxy-N, N, N* , N' -tetramethylrhodamine) 

<400> 39 

tcagaacctg ccagcattga atagtgcc 

<210> 40 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence.'an artificially 
synthesized primer sequence 

<400> 40 

atctgctttg ccccgtatgt 

<210> 41 
<211> 20 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:an artificially 
synthesized primer sequence 

<400> 41 

accgccttgc tgtaggtcag 

<210> 42 

<211> 22 

<212> DNA : 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized TaqMan probe sequence 

<220> 

<221> misc_biriding 
<222> (1) 

<223> Label FAM (6-carboxy-fluorescein) 
<220> 

<221> misc_binding 

<222> (22) 

<223> Label TAMRA 

(6-carboxy-N, N, N' , N' -tetramethylrhodamine) 

<400> 42 

tcgtgccctt cgtcaccgtg aa 

<210> 43 
<211> 21 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequenced artificially 
synthesized primer sequence 

<400> 43 

cccagcatcc ataccagaaa a 

<210> 44 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:an artificially 
synthesized primer sequence 

<400> 44 

ctgtgtccct ctcatgccaa a 

<210> 45 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:an artificially 
synthesized TaqMan probe sequence 

<220> 

<221> misc_binding 
<222> (1) 
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<223> Label FAN (6-carboxy-fluorescein) 
<220> 

<221> miscjnnding 

<222> (28) 

<223> Label TAMRA . 

(6-carboxy-N, N f N' , N' -tetramethylrho dam ine) 

<400> 45 

tgagaaggca gagattccca tccttcct 

<210> 46 
<211> 1? * 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:an artificially 
synthesized primer sequence 

<400> 46 

tcgccatgag caacagcat 

<210> 47 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequenceran artificially 
synthesized prijner sequence 

<400> 47 

cactggactt accgccattg t 
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<210> 48 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized TaqMan probe sequence 

<220> 

<221> misc_binding 
<222> (1) 

<223> Label FAM (6-carboxy^f luqrescein) 
<220> 

<221> misc_binding 

<222> (29) 

<223> Label TAMRA 

(6-carboxy-N, N, N* , N' -tetramethylrhodaraine) 

<400> 48 

agatcatgtt gctccactgg aaggcttct 

<210> 49 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:an artificially 
synthesized primer sequence 

<400> 49 
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ggatctcttt agcccctcaa ttc 



<210> 50 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized primer sequence 

<400> 50 

aaggtcaggt tgagacccca g 

<210> 51 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence.'an artificially 
synthesized TaqMan probe sequence 

<220> 

<221> miscjnnding 
<222> (1) 

<223> Label FAM (6-carboxy-f luorescein) 
<220> 

<221> misc_binding 

<222> (25) 

<223> Label TAMRA 

(6-carboxy-N, N, N' , N' -tetrajnethylrhodamine) 
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<400> 51 • 

aacatttccg tgcccatctt gctgg ' 25 



<210> 52 i 
<211> 21 "I- 
<212> DNA 4 

<213> Artificial Sequence 



<220> ; ] ^ 



<223> Description of Artificial Sequence :an artificially 
synthesized primer sequence ; j 



<400> 52 



v! 
VI 



. j ■ 

<210> 53 
<211> 23 ; 
<212> DNA \ • 

<213> Artificial Sequence 



<220> 1 

<223> Description of Artificial Sequence ran artificially 



<400> S3-i 



i 

<210> $4 



<213> Artificial Sequence 

j 
i 

<220> 



s 



gctgttgact' ttcgaatccc a 21 1 



i " . 1 



synthesized primer sequence rj £j 

■ M • ■ - a j 



? 

acggaggtag ctgtctgaca tga jj 2£ 



<211> 26 ! ' t 

<212> DNA I ! 



BEST AVAILABLE 
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<223> Description of Artificial Sequence:an artificially 
synthesized TaqMan probe sequence 

<220> ; 
<221> misc_binding 
<222> (1) 

<223> Label FAM (6-carboxy-f luorescein) 
<220> 

<221> misc_binding 

<222> (26) 

<223> Label TAMRA 

(6-carboxy-N, N, N' , Nf -tetramethylrhodamine) 

<400> 54 

tgagttcctg gagcagcaac tcacca , 

'.'"'! ■ ! 

<210> 55 
<211> 20 1 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:an artificially 
synthesized primer sequence 

<400> 55 

ggctttcgaa tgcacaggaa 

<210> 56 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequenced artificially 
synthesized primer sequence 

<400> 56 

ggaagccatg ctgaagagga 

<210> 57 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized TaqMan probe sequence 

<220> 

<221> misc_binding 
<222> (1) 

<223> Label FAM (6-carboxy-f luorescein) 
<220> 

<221> misc_binding 

<222> (28) 

<223> Label TAMRA 

(6-carboxy-N, N, N' , N* -tetramethylrhodamine) 

<400> 57 

ttctgcatct atatcctcaa cctggcgg 



<210> 58 -J < ' 

<211> 21 ; I 

<212> DNA I 
<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequenced artificially 
synthesized primer sequence 

<400> 58 

tggcctcttc accctctett t 

<210> 59 

<211> 21 

<212> DNA . 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence :an artificially 
synthesized primer sequence 

<400> 59 

atcaagagct ggcagtcctg a 

<210> 60 
\<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: an artificially 
synthesized TaqMan probe sequence 

<220> 

<221> misc_binding J 
<222> (1) ' 

<223> Label FAM (6-carboxy^fluorescein) 
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<220> 

<221> misc_binding 

<222X (30) 

<223> Label TAMRA 

(6-carboxy-N, N, N' , N' ~t etr amethylrhodamine) 

<400> 60 

tccatatcac tcgctccttc tacctcacca 

<210> 61 

<211> 19 

<212> DNA ' 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence:an artificially 
synthesized primer sequence 

<400> 61 

ccaaaatgcc catcagcct 

<210> 62 . 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequenced artificially 
synthesized primer sequence 

<400> 62 

gcactatgtt gccgacgaaa 
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<210> 63 
<211> 26 
<212> DNA . 

<213> Artificial Sequence 



10 



15 



20 



25 



30 



<220> 

<223> Description of Artificial Sequence'.an artificially 
synthesized TaqMan probe sequence 

<220> 

<221> misc_binding 
<222> (1) 

:<223> Label FAM (6-carboxy-f luorescein) 
<220> 

<221> misc_binding 

<222> (26) 

<223> Label TAMRA 

(6-carboxy-N, N, N' , N' -tetramethylrhodamine) 



<400> 63 

35 catccgctca accgtgctgg ttatct 26 



Claims 

40 

1. A DNA that encodes a guanosine triphosphate-binding protein-coupled receptor, wherein said DNA is selected 
from the group consisting of the following (a) to (d): ( " 

(a) a DNA encoding a protein comprising the amino acid sequence of any one of SEQ ID NOs: 1 to 4 and 17 
45 to 21 ; 

(b) a DNA comprising a coding region of the nucleotide sequence of any one of SEQ ID NOs: 5 to 8 and 22 to 26; 

(c) a DNA encoding a protein comprising the amino acid sequence of any one of SEQ ID NOs: 1 to 4 and 17 
to 21 in which one or more amino acids are substituted, deleted, added, and/or inserted; and 

(d) a DNA hybridizing under stringent conditions to the DNA comprising the nucleotide sequence of any one 
so of SEQ ID NOs: 5 to 8 and 22 to 26. 

2. A DNA encoding a partial peptide of a protein comprising the amino acid sequence of any one of SEQ ID NOs: 1 
to 4 and 17 to 21. 

55 3. A vector comprising the DNA of any one of claims 1 and 2. 

4. A transformant carrying the DNA of any one of claims 1 and 2 or the vector of claim 3. 
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5. A protein or a peptide encoded by the.'DNA of any one of claims 1 and 2. 

6 A method for producing the protein or the peptide of claim 5, said method comprising the steps of culturing the 
transformant of claim 4 and recovering an expressed protein or peptide from the transformant or culture supernatant 
5 thereof. 

7. A method of'screening for ligands that bind to theprotein of claim 5, said method comprising the steps of: 

(a) contacting a test sample with the protein or the peptide of claim 5; and 
10 (b) selecting compounds that binds to said protein or said peptide. 

8. A method of screening for compounds that have activity of inhibiting the binding between the protein of claim 5 
and a ligand thereof, said method comprising the steps of: 

15 (a) contacting the protein of claim 5 or a partial peptide thereof with the ligand in the presence of a test sample 

and detecting a binding activity of said protein or'said partial peptide with said ligand; and 

(b) selecting compounds that reduces the binding activity detected in step (a) as compared with a binding 
activity detected in the absence of the test sample; 

20 9. A method of screening for compounds that inhibit or enhance activity of the protein of claim 5, said method com- 
prising the steps of : . 

(a) contacting a ligand of said protein with cells expressing said protein in the presence of a test sample; 

(b) detecting an alteration in the cells that results from binding of said ligand to said protein; and 

25 (c) selecting compounds that suppress or enhance the alteration detected in step (b) as compared with an 

alteration detected in the cells in the absence of the test sample; 

1 0. The method of claims 8 or 9, wherein the alteration in cells is a change in cAMP concentration or calcium concen- 
tration. 



30 



1 1 . An antibody binding to the protein of claim 5. - t 

, 12. A compound isolated by the method of any one of claim 7 to 10. 

35 13. A pharmaceutical composition comprising the compound of claim 12 as an active ingredient. 

14. The pharmaceutical composition of claim 13, wherein said pharmaceutical composition is formulated for the treat- 
ment of a disease selected from the group consisting of cancer, cirrhosis, and Alzheimer's disease. 

15 A polynucleotide comprising at least 15 nucleotides, wherein said polynucleotide is complementary to the DNA 
comprising the nucleotide sequence of any one of SEQ ID NOs: 5 to 8 and 22 to 26 or a complementary strand 
thereof. 

1 6. A method for diagnosing a disease selected from the group consisting of cancer, cirrhosis, and Alzheimer's disease, 
* said method comprising the steps of detecting expression of the DNA of claim 1 in tissues related to the disease 

derived from a subject, or mutation in the D'NA of claim 1 in the subject. 

1 7. An agent for diagnosing a disease selected from the group consisting of cancer, cirrhosis, and Alzheimer's disease, 
said agent comprising the antibody of claim 11 or the nucleotide of claim 15. 



40 
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>sp|P47901|V1BR_HlMAN VASOPRESSIN V1B RECEPTOR (AVPR V1B) (VASOPRESSIN V3 
RECEPTOR) (AVPR V3) (ANTIDIURETIC HORMONE RECEPTOR IB). 
Length = 424 

Score = 316 (111.2 bits), Expect = '3.7e-41, Sum P(2) = 3.7e-41 
Identities = 70/194 ( 36%), Positives = 115/194 (59X) 

Query: 56 LWVLFVFTIVBNSWLFSTWRR-KKKSRMTFFVTQLAITDSFTGLVNILTDINWRFTGDF 114 

L + V GN VL + + +K+SRM FV LA+TO L +L + Yf T F 
Sbjct: 41 LATVLVLAT6GNLAVLLTLGQLGRKRSRMHLFVLHLALTDLAVALFQVLPQLLWDITYRF 100 

Query: 115 TAPDLVCRWRYLQWLLYASTYVLVSLSIDRYHAIVYPMKFLQGEKQARVLIVIA-WSL 173 

PDL+CR V+YLQV+ ++ASTY+L+44++DRY A+ +P++ LQ 0+ L++ AWL 
Sbjct: 101 QGPDL LCRAVK YLQVLSMFASTYML L AK7L DR YLA VCHPL RSLQQPGQSTYL L I AAPWL L 160 

Query: 174 SFLFSIPTLIIFGKRTL — SNGEVQCWALWPDDSY-V/TP — YMTIVAFLVYFIPLTIISI 228 

+ +FS+P + IF R + +G + CWA D + W P Y+T ++ +P+T+++ 
Sbjct: 161 AAIFSLPQVFIFSLREVIQGSGVLDCWA-~DFGFPWGPRAYLTWTTLA!FVLPVTMLTA 217 

Query: 229 MYGIVl RTIW--IKSKT 243 

y I +K KT 
Sbjct: 218 C YSL I CHE I CKNLKVKT 234 

Score = 131 (46.1 bits), Expect = 3.7e-41, Sum P(2) = 3.7e-41 
Identities = 33/80 (41X), Positives = 47/80 (58X) 

Query: 258 SSYNRGLISKAKIKAIKYSIIIILAFICCWSPYF — LFDILDNFNLLPOTQERFYASVI 314 

SS N IS+AKI+ +K + +I+LA+I CtffP+F ++ + 0 N PD, A I 
Sbjct: 267 SSINT~ISRAKIRTVKWFVIVLAYIACWAPFFSVQMWSVWOK-NA-POEDSTNVAFTI 322 

Query: 315 IQNLPALNSAINPLIYCVFSSSI 337 

L LNS NP IY F+S + 
Sbjct: 323 SMLLGNLNSCCNPWIYHGFNSHL 345 
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Figure 2 

>sp|P31388|5H6_RAT 5-HYDROXYTRYPTAMINE 6 RECEPTOR (5-HT-6) (SEROTONIN RECEPTOR) 
(ST-B17). 
Length = 436 

Score = 224 (78.9 bits), Expect = 6.7e-17, P = 6.7e-17 
Identities = 84/309 (27%), Positives = 144/309 (46%) 

PGEA— LLAGLLVMVLAVALLSNAIVLLCCAYSAELRTRASGVLLVNLSLGHLLLAALOM 60 
PG + + A L V+++ A ++ L++L C A LR S LV+L L++ + M 
PGGSGWVAAALCWI VLTAAANSLL IVL I CTQP A-L RN-TSNFF LVSL FTSDLMVGL WM 80 

P FTL LG VMRGRTPSAPGACQV IGFLDTF L ASN AALSVAALSADQWLAVGFP LR YAGRL R- 1 19 
P +L + GR A G C + D S + L-H- +S D++L + PLRY R+ 
PPAMLNALYGRWVLARGLCLLWTAFDVMCCSASILHLCLISLDRYLLILSPLRYKLRKTA 140 

PRYAGLLLGCAWGQSLAFSGAALGCSWLGYSSAFASCSLRLPPEPERPRFAA — FTATL 1 76 
PR L+LG AW SLA AL S+L + P P + R A F 



Query: 


3 


Sbjct: 


23 


Query: 


61 


Sbjct: 


81 


Query: 


120 


Sbjct: 


141 


Query: 


177 


Sbjct: 


193 


Query: 


226 


Sbjct: 


253 


Query: 


286 


Sbjct: 


312 



-MKALALLAOLHPSVR 225 



V F LP +C T ++ AR+ ++ ++T ++ L + P + 

193 SGVTFFLPSGAICFTYCRILLAARKQAVQVASLTTGTAGQALETLQVPRTPRPGMESADS 252 



+R + R+ +A+ +GI + F + + P+ + +A+ V +.+L+ L Y 



+ +P Y L R F++ L 
ISTMNP 1 1 YPL FMROFKRAL 331 
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Figu.re 3 



>sp|P56479|GALR_M0USE 6ALANIN RECEPTOR TYPE 1 (GAL1-R) (GALR1). 
Length =348 

Score = 269 (94.7 bits), Expect = 7.9e-24, P = 7.9e-24 
Identities =82/289 (28%), Positives = 136/289 (47%) 

Query: 49 VGFVGNLCVIGI LLHNAWKGKP-SMIHSL I LNLSLADLSLLLFSAP I RATAYSKSVWDLG 107 

+G 4GN VI H + GKP S + ILNLS+ADL+ LLF P +AT Y+ W LG 
Sbict: 46 MGVLGNSLVimARSK-PGKPRSTTNLFILNLSIADLAYLLFCIPFQATVYALPTWLG 104 

Query: 108 WFVCKSSDWFIHTCMAAKSLTIVVVA--KVCFMYASDPAKQVSIHNYTIWSVLVAIVTVA 165 

F+CK. +F M T+ ++ + 4S + +4 + V IW ++ 
Sbjct: 105 AFICKFIHYFFTVSMLVSIFTLAAMSVDRYVAIVHSRRSSSLRVSRNALLGVGF-IWALS 16? 

Query: 166 SLLPLPEWFFSTIRHHEGVE-HCLVDVPAVAEEFMSMFGKLYPL — LAF6— LPLFFASF 220 

+ P + + H + + C P + K Y + FG LPL F 

Sbjct: 164 IAMASPVAYHQRLFHRDSNQTFCWEQWPN— KLHKKAYVVCTFVFGYL LPLL L ICF 217 

Query: 221 Yr^RAYDQCKKRGTKTQNLRNQIRSKQNfiVMLLSIAIIS^ 280 

+ + + K+ K + +++ K+ +L + ++ + WLP V LW . A 
Sbjct: 218 CYAKVLNHLHKK-LKNMSKKSEASKKKTAOWLWVVVFGISWLPHHVVHLWAEF— GAF 274 

Query: 281 PAPPQGFI — ALSQVLMFSISSANPLIFLVMSEEFREGLKGVWKWMITKKPPTVSESQE 337 

P P F 4 L +S SS NP+I+ +SE FR+ K V+K + + P SE++E 
Sbjct: 275 PLTPASFFFRITAHCLAySNSSVNPIIYAFLSENFRKAYKQVFKCHVCOESPR-SETKE 332 
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Figure 4 



>sp:HY2R_B0VIH-NEUR0PEPTIDE V RECEPTOR TYPE 2 (NPY2-R). 
Length = 384 

Score = 153 bits (383), Expect = 5e-37 

Identities = 93/308 (30%),. Positives = 164/308 (53%), Gaps = 7/308 (2X) 

Query: 47 DEDEDYTNSRTFFAAKIVIGMALVGIMLVCGiQNFIFIAALVRYKKLRNLTNLLIANLAI 106 

0 + ++ +S ++V+ +A I+L+ y IGN + I ++++K +R +TN IANLA+ 
Sbjct: 38 DSEPELIDSTKLIEVQVVLILAYCSIILLGVIGNSLVIHVVIKFKSMRTVTNFFIANLAV 97 

Query: 107 SDFLVAIVCCPFEMDYYWRQLSWEHGHVLCTSVNYLRTVSLYVSTNALLAIAIDRYLAI 166 

-H) LV- +C PF + Y ++ + W+ G VLC . V Y + +++ VST L IA4DR+ I 
Sbjct: 98 ADLLVNTLCLPFTLTYTLMGE — WKMGPVLCHLVPYAQGLAVQVSTITLTVIALDRHRCI 155 

Query: 167 VHPLRPRMKCQTATGLIALVWTVSILIAIPSAYFTTETVLV2VKSQEKIFCGQIWPV0QQ 226 

V+ L ++ 0+ +1 LWVS L+AP AF +++ 1+ E + C + WP +++ 
Sbjct: 156 VYHLESKISKQISFLIIGLAWGVSALLASPLAIFREYSLIEIIPDFEIVACTEKWPGEEK 215 

Query: 227 -LYYKSYFLFIFGIEFVGPvVTMTLCYARISRELWFKAVPGFQTEQIRKRLRCRRKTVLV 285 

+Y Y L I +V P+ 4+ Y RI H PG + ... +R R+KT + 
Sbjct: 216 GIYGTIYSLSSLLI LYVLPLGnSFSYTRIWSKlKNHVSPGAAHDHYHQR — -RQKTTKM 272 

Query: 286 LMCILTAYVLCWAPFYGFTIVRDFFPTVFVKEKHYLTAFYIVECIAMSNSMINTLCFVTV 345 

L+C++ + + WP + F+ 0 V + KY F + IAM++ NL + + 
Sbjct: 273 LVCWWFAVSWLPLHAFQLAVDIOSHV LDLKEYKLI FTVFHI IAMCSTFANPLLYGWM 331 

Query: 346 KNDTVKYF 353 
++ K F 

Sbjct: 332 NSNYRKAF 339 ; - ; 
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Figure 5 



>sp|P97926|OXYR_MOUSE OXYTOCIN RECEPTOR (OT-R). 
Length =388 

Score =164 (57.7 bits), Expect = 8.9e-22, Sum P(2) = 8.9e-22 
Identities = 57/166 (34%), Positives = 84/166 (50%) 

Query: 24 WGLNLTLGQGAP — ASGPPSR : RVRLVFLGVI LWAVAGNTTVLCRLCGGG 71 

V f L IG G P +GPP R RV + L +11 +A++GN VL L 
Sbjct: 9 WSI ELpLGSGVPPGAEGNLTAGPPRRNEALARVEVAVLCLl LFLALSGNACVLLAL 64 

Query: 72 GPWA6PKRRKMDFLLVQLALADLYACGGTALSGLAWELLGEPRAAT6DLACRFLQLLQAS 131 

K ++ F + L-HAOL L QL W++ R DL CR ++ LQ 

Sbjct: 65 -RTTRHKHSRLFFFMKHLSIAOLWAVFQVLPQLLWDITF—RFYGPDLLCRLVKYLQW i 21 

Query: 1 32 GRGASAHLWL I ALERRRAVRLPHGRPLPARA — LAALG-WLLALLLALPPAFV 182 

G AS +L++L++L+R A+ P . R L R LA L WL L+ ++P + 
Sbjct: 122 GMFASTYLLLLMSLDRCLAICQPL-RSLRRRTORLAVLATWLGCLVASVPQVHI 174 

Score = 155 (54.6 bits), Expect = 8.9e-22, Sum P(2) = 8.9e-22 
Identities = 49/161.(30%), Positives = 85/161 (52%) 

Query: 217 CHGIFAPLPRVmLQv7AFYEAVAGFVAPVTVLGVACGHLLS--VW--RHRPQAPAAAAP 272 

C +F + 5: W "" + Y + +A ++ PV VL AC L+S +W R + A AAAA 
Sbjct: 187 CWAVF--IQPWGPIOWVTW^^ 243 

Query: 273 WSASPG— RAPAPSALPRAKVQSLKMSLLLALLFVGCELPYFAARLAAAWS-SG 323 

S + G R + + +AK++++KM+ ++ L F+ C P+F ++ + W + ■ - 

Sbjct: 244 GSDAAGGAGRAALARVSSVKLISKAKI RTVKMTFH VLAFI VCWTPFFFVQMWSVWVNA 303 

Query : 324 PAGDWEGEGLSAALRVVAMANSALNPFVYLFFQAGDCRLRRQLRKRLGSLCCA 376 

P E A+HA N.S NP++Y+ F L +L +R LCC+ 
Sbjct: 304 PK EASAFI I AM-LLAStNSCCNPWI YMLFTG — HLFHELVQRF— LCCS 347 
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Figure 6 



>sp|Q91 178|GPRX_0RYLA PROBABLE G P ROTE IK-COUPLED RECEPTOR (FRAGMENT). 
Length = 428 

Score = 823 (289.7 bits), Expect = 9.8e-83, P = 9.8e-83 
Identities = 182/422 (43%), Positives = 266/422 (6335) 

Query: 2 ESSPIPQSSGNSSTLGRVPQTPGPSTASGVPEVGL RDVASESVALFFMLLLDLTAV 57 

+4SP+ S + S P P+ P+VG+ + + LF M+ L+L A+ 

Sbjct: 5 KTSPMITSDHSISNFSTGLFGPHPTVP — PDVGVVTSSQSQMKDLFGLFCMvTLNLIAL 61 

Query: 58 AGNAAVMAVIAKTPALRKFVFVFHLCLVDLLAALTLMPLAMLSSSALFDHALFGEVACRL 117 

N VM IA+ P L+KF FV HLC VO+L A+ LMPL ++SSS F +F + C++ 
Sbjct: 62 LANTGVMVAIARAPHLKKFAFVCHLCAVDVLCAILLMPLGIISSSPFFGTWFTILECQV 121 

Query: 118 YLFLSVCFVSLAILSVSAINVERYYYWHPMRYEVRMTLGLVASVLVGVWVKALAMASVP 177 

Y+FL+V + L+I L+++AI +VERY+Y+VHPMRYEV+MT+ IV V++ +W K+L +A V 
Sbjct: 122 YIFLNVFLIWLSILTITAISVERYFYIVHPMRYEVKMTINLVIGVMLLIWFKSLLLALVT 181 

Query: 178 VLGRVSWEEGAPSVPPGCSLQWSHSAYCQLFVWFAVLYFLLPLLLILWYCSMFRVARV 237 

+ G + + CSL SHS +F V+F V+ FL P+++I VY ++++VAR 

Sbjct: 182 LFGWPPYGHQSSIAASHCSLHASHSRLRGVFAVLFCVICFLAPWVIFSVYSAVYKVARS 241 

Query: 238 AAMQHGP-LPTWME-TP-RQRSESLSSRSTMVTSSGAPQT-TPHRTFGGGKAAWLLAVG 293 

AA+Q P +PTW + +P + RS+S++S++T++T+ PQ +P R F GGKAA+ I + 
Sbjct: 242 AALQQVPAVPTWAOASPAKORSDSI NSQTTI ITTRTLPQRLSPERAFSGGKAALTLAFI V 301 

Query: 294 GQFLLCWLPYFSFHLYVALSAQPISTGQVESVVTWIGYFCFTSNPFFYGCLNROIRGELS 353 

GQFL+CWLP+F FHL ++L+ S G +£ V W+ Y F MP FYG LNRQIR EL 
Sbjct: 302 GQFLVCWLPFF1FHLQMSLTGSMKSPG0LEEAVNWLAYSSFAVNPSFYGLLNRQIRDELV 361 

Query: 354 K-QFVCFFKPAPEEELRLPSREGSIEENFLQFLQGTGCPSESWVSRPLPSPKQ-EPPAVD 411 
. K + C +P E+ S EGS +ENFLQF+Q T SE+ S +P+ E A 

Sbjct: 362 KFRRCCVTQPV EIGPSSLEGSFQENFLOFlQRTSSSSETHPSFANSNPRrWENQA — 416 

Y 

Query: 412 FRIPGQIAEE 421 

+IPGQI EE . ! 
Sbjct: 417 HKIPGQIPEE 426^ ' ; ' 
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Figure 7 



>sp|P23749|RTA_RAT PROBABLE G PROTEIN-COUPLED RECEPTOR RTA. 






Length =. 343 




Score 


= 461 (162.3 bits). Expect = 2.3e-44. P = 2.3e-44 




Identities • 


=121/323 (37%), Positives = 178/323 (55X) 




Query: 


2 


NQTLNSSGTVESALNYSRGS-TVHT-AYL VLSSLAMFTCLCGMAGNSMVIWLLGFR 


55 






NQ G E+ YSRG T+ AL V + + + CLCG+ GN +V+W GF 




Sbjct: 


13 


NQNKMCPGMSEALELYSRGFLTIEQIATLPPPAVTNYIFLLLCLCGLVGNGLVLWFFGFS 


72 


Query: 


56 


MHRNPFCIYILNLAAADLLFLFSMASTLSLETQPLVNT-TDKVHELMKRLMYFAYTVGLS 


114 






+ R PF IY L+LA+AD ++LFS A L ++DV+ + + +G+S 




Sbjct: 


73 


IKRTPFSIYFLHLASADGIYLFSKAVIALLNMGTFLGSFPDYVRRVSRIVGLCTFFAGVS 


132 


Query: 


115 


LLTAISTQRCLSVLFPIWFKCHRPRHLSAV/VCGLLWTLCLLMNGLTSSFCSKFL— KFNE 


172 






LL AIS -+RC+SV+FP+W+ RP+ LSA VC LLW L L+ + + FC FL + + 




Sbjct: 


133 


LLPAI SI ERCVSVI FPMWYWRRRPKRLSAGVCALLWLLSFLVTSIHNYFCM-FLGHEASG 


191 


Query: 


173 


ORCFRVDMVQAALIMGVLTPVMTLSSLTLFVWVRRSSQQWRRQPTRLFWVLASVLVFLI 


232 




C +0+ L+ + P4M L L L + V +++ R++ +L WLA V VFL+ 




Sbjct: 


192 


TACLNMDISLGILLFFLFCPLMVLPCLALILHVECRARR-RQRSAKLNHWLAIVSVFLV 


250 


Query : 


233 


CSLPLSIYWFVLYWL-SLPPEMQVLCFSLSRLSSSVSSSANPVIYFLVGSRRSHRLPTRS 


291 






S+ L I WF L+W+ +P ++ L ++SSA P++YFL G +S RL 




Sbjct: 


251 


SSIYLGIDWF-LFWyFQIPAPFPEY— VTDLCICINSSAKPIVYFLAGROKSQRL-WEP 


305 


Query: 


292 


LGTVLQQALRE— EPELEGGETPTVGTNEM 319 








L V Q+ALR+ EP TP T EM 




Sbjct: 


306 


LRVVFQRALROGAEPGOAASSTPNTVTMEM 335 
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Figure 8 



> sp 1 098907 |P2Y3 CHICK P2Y PURINOCEPTOR 3 (P2Y3) (NUCLEOSIDE DIPHOSPHATE 
RECEPTOR). 
Length = 328 

Score = 452 (159.1 bits), Expect = 2.0e-43, P = 2.0e-43 
Identities = 85/185 (45X), Positives = 116/185 (62%) 

Query: 15 CQFSEKYKQVYLSLAYSII FI L6LPLNGTVLWHFWGQTKRWSCATTYLVNLMVADLLYVL 74 

C F E++KQV L L YS++F+LGLPLN V+ W K + T Y++NL +ADLLYV 
Sbjct: 13 CTFHEEFKQVLLPLVYSWFLLGLPLNAWIGQIWLARKAL7RTTIYMLNLAMA0LLYVC 72 

• Query: 75 -LPFLIITYSLDDRWPFGELLCKLVHFLFYINLYGSILLLTCISVHQFLGVCHPLCSLPY 133 
LP LI Y+ 0 WPFG+ CK V F FY HL+GSIL LTCISV +++G+CHPL S 
Sbjct: 73 SLP LL I YNYTQKDYWP FGDFTCKFVR FQFYT N LHGS I LF I.T C I SVQRYMG I CHPLASWHK 132 

Query: 134 RT-RRHAWLGTSTTV/ALWLQLLPTLAFSHTDYINGQMIWYDMTSQENFDRLFAYGIVLT 192 

+ ++ WL + W +V+ Q LPT F+ T + + YD++ + F YGI LT 
Sbjct: 133 KKGKKLTWLVCAAVWFIVIAQCLPTFVFASTGTQRNRTVCYDLSPPDRSTSYFPYGITLT 192 

Query: 193 LSGFL 197 
++6FL 

SbjctT 193 ITGFL 197 
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>sp| 002824 |A1AA_RABIT ALPHA-1A ADRENERGIC RECEPTOR (ALPHA 1 A- ADRENOCEPTOR) 
(ALPHA-1C ADRENERGIC RECEPTOR). 
Length = 466 

Score = 295 (103.8 bits), Expect = 1. Oe-31, Sum P(2) = 1 .Oe-31 
Identities = 66/215 (30%), Positives = 113/215 (52%) 

Query: 8 STRESNSSHTCMPLSKMPISLAHGMRSTVLVIFLAASFVGNIVLALVLQRKPQLLQVTN 67 

S S+SS+ P + P++++ 14 +L 4 4GNI44 L + L VT+ 
Sbjct: 5 SGNASDSSNCTHPPA — PVNI SKAI LLGVI LGGLI LFGVLGNI LVI LSVACHRHLHSVTH 62 

Query: 68 RFIFNLLVTDLLQISLVAPWWATSVPLFWPLNSHFCTALVSLTHLFAFASVNTIWVSV 127 

+1 NL V DLL S V P+ + +W FC 4+ L AS+ ++ V+S+ 
Sbjct: 63 YYI VNLAVADLLLTSTVLPFSAI FEI LGYWAFGRVFCN1WAAVDVLCCTASI ISLCVI SI ' 1 22 

Query: 128 DRYLSI1HPLSYPSKMTQRRGYLLLYGTWIVAILQSTPPLYGWGQAAFDERNALCSM1WG 187 

DRY+ + +PL YP4 +TQRRG L W +++ S PL+GW Q A D+ +C + 
Sbjct: 123 DRYIGVSYPLRYPTIVTQRRGLRALLCVWAFSLVISVGPLFGWRQPAPDDET-ICQI— N 179 

Query: 188 ASPSYTILSVVSFIVIPLIVMIACYSWFCAARRQ 222 

P Y 4 S + +PL 4+4A Y V+ A+R+ 
Sbjct: 180 EEPGYVLFSALGSFYVPLTIILAMYCRVYWAKRE 214 

Score = 106 (37.3 bits), Expect = 1. Oe-31, Sum P(2) = 1. Oe-31 
Identities = 23/75 (30*), Positives = 41/75 (54%) 

Query: 396 KAAKVIFI 1 1 FSYVLSLGPYCFLAVLAVWVDVETQVPQWVITI I IWLFFLQCCIHPYVYG 455 

KAAK 4 144 +VL P* 4 4 4 4 4 P4 V 14 WL +L CI+P +Y 
Sbjct: 259 KAAKTLGIWGCFVLCWLPFFLVMPIGSFFP-DFKPPETVFKIVFWLGYLNSCINPIIYP 327 

Query: 456 YMHKT I K KE I QDMLK 470 

4 KK Q++LK 
Sbjct: 328 CSSQEFKKAFQNVLK 342 
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Figure 10 




! J ! L ! ; ! i 

1 51 ; 101 151 201 251 301 351 

Amino acids 



BEST AVAILABLE COPY 
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lllllltl 

X64878 MECALAAN ISA-EAA-NASAAPPGAEC NR7AGPPRRNEALARVEVAVLCL I L 

U82440 WEGELAAN — WST-EAV-NSSAAPPCAEG NCTAGPPRRNEALARVEVAVLCLIL 

«t»ii iicri ri ui nrc-ci D-w^yyiJc^ucunuuccucTpnDi vpuccyAvvFyTyi ai m 



X87783 MEEHFKEQDF-ISFNESSRNSTVGNETFGG NQTVNPLKRNEEVAKVEVTVLALVL 

AF184966 MEKPGNITLHP NGSDPF GRNEEVAQ I E I UVLS 1TF 

X76321 MGR IANQT7AS r — NDTDPFCRNEEVAKIIEI TVLSVTF 

AF147743 MKNFSFPMQD-STHQTESPPHRLLSLTNKS DP VGRPERDEQLAQVE I AVLGV I F 

GPRvS IIPANFTEGSFDSSGTGQTLDSSPVACTETVTFTEVVEGKEIGSFYYSFKTEQLITLIVLF 

AE003754 - HKCDHTLFFALFQTEQFAVLI ILF 

TH1 ttttttttttt tttttttt TM2 tttttttt 

X64878 LLALSGNACVLLALRTTRQKHSRLFFFMKHLS I ADLVVAVFQVLPQLLIDITFRFYGPDL 

U82440 FLALSGNACVLLALRTTRHKHSRLFFFMKHLS I ADLVVAVFQ VLPQLLID I TFRFYGPOL 

X9331 3 FLALAGN I CVLLG I Y I NRHKHSRMYFFMKHLS I ADLVVAI FQVLPQL HDITFRFf APDL 

X87783 FLALAGNLCVL IAI YTAKHTQSRMYYLMKHLS IADLVVAVFQVLPQLIIDITFRFYGPDF 

AF 1 84966 WAV I GNVSVLLAMYNTKKKMSRMHLF I KHLSLADLVVAFFQVLPQLCIE ITYRFFGPDF 

X76 3 2 1 FVAY I GNLS VLLAMHNTKKK SS RIIHLF I KHLSLADMVVAFFQ VLFQLCIE I TFRF YGPDF 

AF147743 LTASVGNF I L I LVLWRRRKKLSRMYVFVLHLS I ADL VVAFFQVLPQL I WD I TDVF I GPDF 

GPRv8 VFT I YGNSVVLFST1R-RKKKSRHTFFVTQLA I TOSFTGLYN I LTD I NIRFTGDFTAPDL 

AEQ037S4 TV I VLGNSAVLFVMF I NKNRKS RMNYF I KQLALADLCVGLLNVLTD I IIRITI SIRAGNL 

dStfltttltl TM3 flfltlftllt tttttttttt TH4 t 

X64878 LCRLVKYLQVVGMFASTYLLLLHSLDRCLA I CQPLRSLRRRT— DRLAVUTf LGCLVAS 

U82440 LCRLVKYLQVVCMFASTYLLLLMSLDRCU I CQPLRSLRRRT— DRLAVLATILGCLVAS 

X93313 VCRLVTYLQVVGMFASTYHLLLHSLDRCLAICQPLRSLHRRS — DCVYVLFTI ILSFLLS 

X87783 LCRLVKYLQ7 VGMFASTYHLVLMS I DRC I A I CQPLRSLHKRK — DRCYV 1 VSIALSLVFS 

AF 1 84966 LCR I VKHLQVTGMFASTYMHVMMTLDRY I A I CHPLKTLQQPTQRSY IV I VSTWC3LVFS 

X7632 1 LCR I VKHLQVLGMFASTYMUVMITTLDRY I A I CHPLK7LQQPTQRAY Wl GSTILCSLLLS 

AF1 477 43 LCR 1 1 KYLQLLGUFASTYII I V VUTVDRYQAVCYPHVTFQKKRALIN I P I CTSWS ISLiLS 

GPRv8 VCRVVRYLQVVLLYASTYVLYSLSIDRYHAIVYPtlKFLQGEKQ-ARVLI VIAISLSFLFS 

AE0037 54 ACKA I RFSQVCVTYSSTYVLYAMS I DRYDA I THPMNFSKSWKR-ARHLVAGAWL I SALFS 
»: : . * *: *: ::*.:» 

tttttttttt § tttttttttt TH5 Itlllllllllll 

X64878 APQVH I FSLREVADG — VFDCIAVF I QP— IGPKAV I TI I TLAYY I VPV I VLATCYGL I S 

U82440 APQVH I FSLREVADG — VFDCIAVF i QP--IGPKAY I TI I TLAVY I VPV I VLAACYGL I S 

X93313 TPQT V I FSLTEVGHG — VYDCRADF I QP--IGPKAY I T* I TLAVY I IPVIILSVCYGLIS 

X87783 VPQVY I FSLRE I GNG — VYDCIGDFVQP — WGAKAY ITWISLTI Yl I PVA ILGGCYGL I S 

AF 1 84 9 6 6 TPQYF I FSLSEVKNGSTVKDCIAHF I EP — IGARAY I TI I TGG I FLVPVY I LVMCYGF I C 

X76321 TPQYF I FSLSE I QNGSYVYDCWGHF I EP—WG I RAY I TI I TVG I FL I PV I ILMICYGF I C 

AF147743 LPQ VF I FSK I E I SPG — I FECIAEF I QP— WGPRAYVTI I LVV I FF J PST I L ) TCQVK I C 

GPRvS IPTLI IFGK RTLSNG— EVQCIALIPDDS YITP— YMT I VAFLVYF JPLTIISI HYG1 V I 

AE003754 LP I LVLYEEKLIQGH PQCI I ELGSP t AIQV — YBSL VSATLFA I PAL I ISACYAI IV 
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Figure 12 



tffftttf TW6 

X64878 FK i WQNIRLKTAAAAAAEAPEGAAAGDGGRVALARVSSVKL I SKAK I RTVKHTF I IVLAF 

U82440 FK I VQNLRLKTAAAAAAEAPEGAAACDGGRIIALARVSSVKL I SKAK I RTVKHTF 1 1 VLAF 

XS33 1 3 YKIIQNI RLKTYCESNLRLST— SRRATLSRVSSVRL I SKAK I RTVKHTF I i VLAY 

X87783 FK I RQNFKRKTKKDQC I TLTTAA SKAHALARVSSVKLVSKAK I TTVKMTFV I VLAY 

AF1B4966 HT IHKN IKYKKRKT I PGAAS KNGL I GKNSVSSVTT I SRAKLRTVKHTFV I VLAY 

X76321 HSIHKNIKCK — TMRGTRNT KDCyiGKVSVSSVTI I SRAKLRTVKMTLV I VLAY 

AF147743 K 1 1 KRN I YVKKQNEYQVTNQ KQVLPSR ASSVNC I SKAK I KT VKMT I VTVVAY 

GPRvS RT I f 1 KSKT YETV I SNCSDG — KLCSSYNRGLISKAKIKAIKYSI 1 1 ILAF 

AE003754 KT INAKGS I FVPTERAGFGA — APARRASSRGI I PRAKVKTVKHTLT I VFVF 

IfttfilSJSSHtltS tlflfttSlfll TM7 tltlfttff 

X64B78 I VCiTPFFFVQIJISVWDANAPK, — EASAF 1 1 VHLLASLNSCCNPI I YMLFTGHLFHELV 

U82440 I VCWTPFFFVQHW5VWDANAPK — -EASAF 1 1 VVLLASLNSCCNPW I YMLFTGHLFHELV 

X93313 IveWTPFFFVQMiSVIDPNPPK — EASLF 1 I AHLLGSLNSCCNPW1 YMLFTGHLFKDLL 

X87783 I VCf TPFFFVQyiSAtDPEAPR — ^EAMPF 1 1 StfLLASLNSCCNPW I YMFFAGHLFHDLK 

AF1849S6 I I CHAPFFT VQWVS VWDENFQY ADSENTAVT I SALLASLNSCCNPI I YM I F SGHLLQOFM 

X76321 I VCWAPFF \ VQHf SYWDENFSfDDSENAA VTLSALLASLNSCCNPI I YMLFSGHLL YDFL 

AF147743 YLC1SPFF I AQLWSViFPSG I T EGSAFTI IMLLGNLNSCTNPWI YMYFCCHIPY 

GPRvB I CCISPYFLFD 1 LDNFNLLPDT-QERF Y AS V I I QNLPALNSA ! NPL I YCVFSSS I SFP— 

AEO03754 t ICISPYI I FDLLQVFGQ I PHS-QTN I A I ATF I QSLAPLNSAANPL I YCLF5SQVFRTLS 
: * m. tt u t . 

X6487B QR- FLCCSASYLKGRRLG — ETSASKKSN SSSFVLSHRSSSQRSCSQPS 

U82440 QR- FLCCSASYLKGNRLG — ETSTSKKSN SSSFVLSHRSSSQRSCSQPS 

X93313 QS FLCCSARYLKTQQQGS-DLSASRKSN SSTFVLSRKSSSQKSITQPS 

X87783 QS LLCCSTLYLKSSQCRCDQEHDSRKSN CSTYVIKSTSS-QRS ITQSS 

AF184966 NC FAICRRANADFKKED — SOSS I RRTT LLTKUTN-RSPTGSTGNtRD 

X76 32 1 RC FPCCKKPRK1ILQKED— SDSS I RRKT LLTKLAAGRMTNDGFCSIRD 

AF1 47743 : CTNKQLENTSAQ — EDSVVTGS 1 HLVD-RDPEENSTCA — 

GPRvB CREQRSQDSRMT FRERTER — HEHQILS — KP-EF 

AE003754 RFPPFKWFTCCCKSYRNNSQQNRCHTVGRRLHNSCOSMRTLTTSLTVSRRSTNKTNARVV 

t * : 

X64878 TA — — — 

U82440 7A-- : 

X93313 TA- 

X87783 IT ■ 

AF184966 LDNSPK — TS I QUE ■ : 

X76321 PCMSRKSSQS I GLDCFCKSSQCLEHDCSRKSSQC I PLDCSRKSSQCI PLDCSRKSSQCHS 

AF147743 

GPRv8 r 

AE003754 ICERPTKVVTVPAMSERRCVSLKGNTOIL - — 

XS4878 — 

U82440 • — 

X93313 — r 

X87783 — 

AF184966 — 

X76321 KES 

AF147743 — 

GPRv8 — 

AE003754 
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ftlttttttft TM1 fffftfSIUtf fftltflftftl TM2 III 

GPRv12_0RF MGPGEALLAGLLVMVLAVALLSNALVLLCCAYSAELRTRASGVLLVNLSLGHLLLAALDM 
AF208288 . MNSWDAGLAGLLVGT I CVSLLSNGLVLLCLLHSAD I RRQAPALFTLNLTCGNLLCTVVNII 

:l mui :**:'*:**:.::*. 

tttttttt illtftSIIII TM3 tiStttttitt 

GPRv 1 2_0RF . PFTLLGVMRGRTPSAPGACQV I GFLDTFLASNAALSVAALS ADQWLAVGFPLRYACRLRP 
AFZ08288 PL7LAGYVAQRQPAGDRLCRLAAFLDTFLAANSMLSMAALS I DRIVAVVFPLSYRAKURL 

tt: * t:: .**<****;*: M* * 

tnttttttt TM4 tmtttt % ttttttttt 

CPRv12J)RF RYAGLLLGCAWGQSLAFSGAALGCSWLGYSSAFASCSLRLPPEPERPRFAAFTATLHAVG 
AF208288 . RDAAFMVAYTWLHALTFPATALALSWLGFHQLYASCTLCSRRPDERLRFAVFTSAFHALS 

TU5 illMillllt M 

CPRv12J)RF FVLPLAVLCLTSLQVHRVARRHCQRIIDTVTIIKALALLADLHPSVRQRCLIQQKRRRHRAT 
AF208288 FLLSF I VLCFTYLKVLKVARFHCKR I DV I TMQTLVLLVD I HPSVRERCLEEQKRRRQRAT 

: tt*:t t:t a*:*:*.:**::*.**.*:*****:*** : ttt*t:ttt 

If If II TH6 f I ff f flff If llllllfif TM7 fflff fflff 

GPRv 1 2 J)RF . RKIGIAI ATFL I CFAP YVMTRLAELVPFVTVNAQIG I LSKCLTYSKAVADPFTYSLLRRP 
AF208288 KK I STF I GTFLVCFAPYV I TRL VELFSTAP I DSHIGVLSKCLAYSKAASDPFVYSLLRHQ 



GPRv12J)RF FRQVLAGMVHRLLKRTPRPASTHDSSLDVAGIIVHOLLKRTPRPASTHNGSVDTENDSCLQ 
AF208288, YRRSCKELLNR! FNRR S I HS VGLTGDSHSQN I LPVSE — 



GPRv12_ORF QTH 
AF208288 
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******* 
tSttSttt TMl ##•#*# 

1 MLAAAFADSN SSSMNVSFAH LHFAGGYLPS DSQDWRTIIP ALLVAVCLVG FVGNLCVIG1 60 

*M*m*********mM*****«*«**************<******************* 
II Stttt TM2 «##f # 

61 LLHNAWKGKP SHIHSLILNL SLADLSLLLF SAP I RAT AYS KSViDLGWFV CKSSD»FIHT 120 

************** **************************** 

§iiiBi tm3 utmtttt uut ; TIM ttmitu 

121 CUAAKSLTIV VVAKVCFMYA SDPAKQVSIH NYTIKSVLVA IWTVASLLPL PEWFFST I RH 180 

. T i llllttll ths ttsitiitsm 

181 MEGVEMCLVD VPAVAEEFMS MFGKLYPLLA FGLPLFFASF YFWRAYDQCK KRGTKTQNLR 240 

****************m*t*m*M************************************* 
Itttttlltl TM6 Jlllllllll ttillttt TM7 It 

241 NQ1RSKQVTV HILLS I A I.J SA VLf LPEWVA1 LNVWHLKAAG PAPPQGFIAL SQVLMFSISS 300 

****** 
ttlSfSttS 

301 ANPLIFLVUS EEFRECLKGV KKWUITKKPP TVSESQETPA GNSEGLPDKV PSPESPASIP 360 
361 EKEKPSSPSS GKGKTEKAEI PILPOVEQFW HERDTVPSVQ ONDPIPWEHE DQETGEGVK 419 
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Figure 18 



GPRv21 METTMGFNDDNATNTSTSFLSVLNPHGAHA-TSFPFN 

All 2 1755 

AF236082 — METTVGALGENTTDTFTDFFSALDGHEAQT-GSLPFT 

U42766 MGP IGAEADENQTYEEMKYEQYGP 

U76254 — -MGP IGAEAOENQTVEEUKVEQYGP 

U42389 MGP I GAEADEHQTVEEMKVEQYGP 

U501 44 : — -MKMGPLGAEADENQTVEEMKVDQFGPG 

086238 MYLKMGPVGAEADEH-QTVEVKVEPYGPG 

1181 490 MYY I AHQQPMLRNEDDNYQEGYF I RPDPASL I YNTTALPADDEGSNYGYGSTT-TLSGLQ 

AF037444 — MSUANSENSTSLFGI KRHADVTGPHSASHDVIDPSNTSVYYDHASNYESVLSTTSTLH 

tttitttttSS TM1 

GPRv21 FSYSDYDMPL DEDEDVTNSR TFFAAK I V I GMAL VG I ML VCG I GNF I F 

AL 1 2 1 755 — YGDYDLPII DEDEDKTKTR TFFAAK I V I G I ALAG I HLVCG I GNFYF 

AF236082 FS YGDYDMPL -DEEEDVTNSR TFFAAK I V I GMALVG I HLVCG I GNF I F 

U42766 QTTPRGELVP DPEPELIDST KL I E VQVVL I LAYCS 1 1 LLGV I GNSLV 

U76254 QTTPRGELVP DPEPELIDST -KL I E VQVVL I LAYCS 1 1 LLGV I GNSLV 

U42389 QTTPRGELVP DPEPELIDST KL I E VQVVL I LAYCS 1 1 LLGV I GHSLV 

U50144 HTTLPGELAP DSEPELIOST -KL I EVQVVL I LAYCS 1 1 LLGV I GNSLV 

D8S238 HTTPRGELPP DPEPELIDST -KLVEVQV I L I LAYCS 1 1 L LGVVGNSLV 

U81490 FETYNI TVMMNFSCDDYDLLSEDMV SSAYFK 1 1 V YIILY I P I F I FAL I GNGTV 

AF037444 LKLTDLVTPFNASEPDPESNGSDTDGGHAA ISEQPMYAKV 1 1 VLUYVL 1 1 LVAVGGNLLF 

y % i m . 

IftflfTl ttittU TM2 tlitittt gfttttt 

GPRv21 IAALVRYKKLRNLTNLLIANLAISDFLVAI VCCPFEMDYYVVRQLSIEHGHVLCTSVNYL 

ALT 21755 I AALTR YKKLRNLTNLL I ANLA I SDFLVA I ICCPFEMDYYVVRQLSIEHGHVLCASVNYL 

AF236082 ITALARYKKLRNLTNLLIANLAISDFLVAI VCCPFEMDYYVVRQLSIEHGHVLCASVNYL 

U4276G I HVV I KFKSMRTVTNFF I ANLAVADLLVNTLCLPFTLTYTLHG — EIKHGPVLCHLVPYA 

U76254 I HVV I KFKSHRTVTNFF I ANLAVADLLVNTLCLPFTLTYTLHG — EWKMGPVLCHLVPYA 

U42389 IHVVI KFKSMRTVTNFF IANLAVADLLVNTLCLPFTLTYTLHG — EIKHGPVLCHLVPYA 

U50144 IHVVI KFKSMRTVTNFF I ANLAVADLLVNTLCLPFTLTYTLMG — EIKHGPVLCHLVPYA 

088238 I HVV I KFKSMRTVTNFF I ANLAVADLLVNTLCLPFTLTYTLHG— EIKHGPVLCHLVPYA 

H81490 CY I VYSTPRMRTVTNYF I ASLAIGO I LMSFFCEPSSF ISLFI LN-YIPFGLALCHFVNYS 

AF037444 S YV I VHYPKHRS VTNLFLLNLAI SD I VKAV I CNPFAF IANLI LL-YIPYGEFMCQVVTY I 
: :*. :** :: . * * * * :* * * 

It TH3 tttttttttt tUiMiit TH4 ttSttttttt 

CPRv 2 1 RTVSLY VSTNALLA I A I DRYLA I VHPLRPRMKCQTATGL I ALVWTVS I L I A I PSAYFTTE 

AL12175S RTVSLY VSTNALLA I A I DRYLA I VHPLKPRMNYQTASFL I ALVIMVS I L I A I PSAVFATE 

AF236082 RTVSLYVSTNALLA I A I DRYLA I VHPLRPRHKCQTAAGL I FLVf SVS I L I A I PAAYFTTE 

U42766 QGLAVQVST I TLT V I ALDRHRC I VYHLESK I SKR I SFL 1 1 GLAIG I SALLASPLA I FREY 

U7S254 QGLAVQVST I TLTVI ALDRHRC I VYHLESK I SKR ISFLI I GLAIG I SALLASPLA I FREY 

U42 389 QGLAVQVST I TLTV I ALDRHRC I VYHLESK I SKR I SFL 1 1 GLGIR I SALLASPLA I FREY 

U 50 1 44 QGLAVQVST I TLT V I ALDRHRC I V YHLESK ISKQIS FL 1 1 GLAIG VSA LL AS PLA I FREY 

D86238 QGLAVQVST I TLT V I ALDRHRC I VYHLESK I SKR I SFL 1 1 GLAIG I SALLASPLA I FREY 

M8 1 490 QAVSVLVSA YTL VA I S I DR Y I A I MWPLKPR I TKRYATF 1 1 AGVIF I ALATALP I P I VSGL 

AF037444 QVVAVFLS AFTLVAMSVDR Y VA I LKPMRPRLSKRAFA I THAT 1 1 1 LSLSAPLPTA I TSRV 
: ::: :*: :» .*: : t :: . t . 
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§ ttimttt TH5 JINIIIIMIII 

GPRv21 TVLV I VKSQ — EK IFCGQ I KPVDQQ-LY YKSYFLF I FG I EFVGPVVTMTLCYAR I SRELW 

AL1 2 1 755 TVLF I VKSQ — EK I FCGQ I WPVDQQ-LYYKSYFLF I FGVEFVGPYVTMTLCVAR I SRELI 

AF236082 TVLV I VERQ--EK I FCGQ I WPVDQQ-FY YRSYFLL VFGLEFVGPVVAMTLCYARVSRELI 

■ «i ici iDne_PIVACTFKVPRF£ICStYftTVV$j SSM II YVI PI 0 1 1 SFS YTR I fSKLK 

U76254 SL'I E 1 1 PDF — i i VACTEKIPGeIkS I YGTVYSLSSLL I LYVLPLG 1 I SFSYTR I VSKLK 

U42389 SL I E 1 1 PDF— E I VPCTEKIPAEEKS I YGTVYSLSSLL I LYVLP1GI I SFSYTR I ISKLK 

0501 44 SL I E 1 1 PDF — E I VACTEKIPGEEKG I YGT I YSLSSLL I LYVLPLG 1 1 SFSYTR I HSKLK 

D86238 SLIEI I PDF— E I VACTEKIPGEEKSVYGTVYSLSTLL I LYVLPLG II SFSYTR IISKLR 

M81490 D I PMSPIHTKCEKY I CREMIPSRSQ — E YYYTLSLFALQFVVPLGVLI FTYAR ITIRVI 

AF037444 TKQSNSTGL CLEHFENDHN— R Y I YS I V I MHLQYFVPLAV I TVTNTHI GY I VI 

# • ■ • • • • 

tttttlttl 1MB 99$ tUt 

GPRv21 FKAVPG-FQTEQI RKRLRCRRKTVLVLMCI LTAY VLCiAPFYGFT I VROFFPTVFVKEKH 

AL1 21755 FKAVPG-FQTEQI RKRLRCRRKTVLVLUC I LTAY-VLCBAPFVGFT I VROFFPTVFVKEKH 

AF236082 FKAVPG-FQTEQIRRTVRCRRRTVLGLVCVLSAYVLCIAPFYGRIVRDFFPSVFVKEKH 

U427SS NHVSPG-AANDHYHQR — RQKTTKHLVCVVVVFAVSWLPLHAFQUVD-IDSQVLDLKE 

U76254 SHVSPG-AANDHYHQR — RQKTTKHLVCVVVVFAVSILPLHAFQLAVD-IDSQVLDLKE 

U42389 NHVSPG-AANDHYHQR RQKTTKNLVCVVVVFAVSILPLHAFQLAVD-IOSQVLDLKE 

U50144 NHVS PG-AAHDHYHQR — RQKTTKNLVCVVVVFAVSWLPLHAFQLAVD-IDSHVLDLKE 

D86238 NHVSPG-AASDHYHQR — RHKMTKNLVC VVV VFAVSWLPLHAFQLAVD- 1 DSHVLDLKE 

U81490 AKRPPGEAETNRDQRNARSKRKHVKHMLTVVIVFTCCiLPFNILQLLLN— DEEFAHIDP 

AF037444 I KKTPGEAEEDRDRRHAASKRRL VKM 1 1 1 VVV I YAVC1LPVHV ITLVGD-HNPD I YNQPH 
: « :: :: :::.:::::..«♦.::: 

tiittSff TM7 IMIlilll 

GPRv21 YLTAFY I VEC I AMSNSH I NTLCF VTVKNDT VKYFKK I ML— LHIKASYNGGKS 

AL 1 21755 YLTAFYVVEC I AMSNSH I NTVCFVTVKNNTMKYFKKMML LHMRPSQRGSKS 

AF23BD82 YLTAFYVVEC I AMSNSH I NTLCFVTVRNNTSK YLKR I LR — — LQMRASPSGSKA 

U4276B YKLIFTVFHI IAMCSTFANPLLYGW1INSNYRKAFLSAFR CEQRLDAIHSEV 

U76254 YKLIFTVFYI IAHCSTFANPLLYGWHNSNYRKAFLSAFR— ^— CEQRLDAIHSEV 

U42389 YKLIFTVFHI lAMCSTFANPLLYGWMNSNYRKAFLSAFR CEQRLDAIHSEV 

U50144 YKLIFTVFHI lAMCSTFANPLLYMMNSNYRKAFLSAFR CEQRLDAIHSEV 

086238 YKLIFTVFHI IAMCSTFANPLLYGWNSNYRKAFLSAFR CEQRLOAIHSEV 

M81490 LPYVIFAFHILAMSHCCYNP 1 1 YCYMNARFRSGFVQLMHRHPGLRRWCCLRSVGDRMNAT 

AF037444 MNVVWLCAQWLAHSHSCYNPFVYFSLSATFRRNLRRMTHACRLKQKR-LRQHLSMRSSRA 

GPRv21 S ADLDLKT I GM — -PATEEVDC I RLK 

AL121755 S- ADLDLRTNGV — PTTEEVDC I RLK 

AF236D82 S-^- — -ADLDLRTTG I — PATEEVDC I RLK 

U42766 SVTFKAK- — —KNLEVRKNSG — PNDSFTEATNV— 

U76254 SVTFKAK KNLEVRKNSG — PNDSFTEATNV 

U42389 SVTFKAK KNLEVRKNSG — PNDSFTEATNV 

U50144 SVTFKAK KHLQVTKNNG— PNDSFTETTNV • : 

086238 SMTFKAK KNLEVKKNNG — PTDSFSEATNV- ■ 

M8 1 490 SGTGPALPLN — RMNTSTTY I SARRKPRATSLRANPLSCGETS PLR 

AF037444 DAWDRDTEVYGSAES I PSKVSAGSLHSSNRGAKHVNTSSGEWQCLKEKKLKGVSNDMYL 
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Figure 21 



** 

ttttttt Till ft 

1 MEDLFSPSIL PPAPNISVPI LLGWGLNLTL CQGAPASGPP SRRVRLVFLG VILVVAVAGN 60 

ittttuu******************************************************** 
tttt lift**** Til 2 *••«****» 

61 TTVLCRLCGG GGPWAGPKRR KMDFLLVQLA LADLYACGGT ALSQLAHELL GEPRAATGDL 120 

******MM*********M***M*mM******M**m******************** 

§ ***** TM3 t»«****tt* ******* TM4 fttt 

121 ACRFLQLLQA SGRGASAHLV VUALERRRA VRLPHGRPLP ARALAALGWL LALLLALPPA 180 

* ' ; ■■ 

************** 

l«# 6 ********* TM5 

181 FVVRGDSPSP LPPPPPPTSL QPGAPPAARA WPGQRRCHGI FAPLPRWHLQ VYAFYEAVAG 240 

***************************************************************** 

4<*4**t«**** ***** 

241 FVAPVTVLGV ACGHLLSVWI RHRPQAPAAA APKSASPGRA PAPSALPRAK VQSLKMSLLL 300 

********************************************************* 
t TM6 ********* . ******* TU7 ********** 

301 ALLFVGCELP YFAARLAAA* SSGPAGDWEG EGLSAALRVV AMANSALNPF VYLFFQAGDC 360 

361 RLRRQLRKRL GSLCCAPQGG AEDEEGPRGH QALYRQRKPH PHYHHARREP LOEGGLRPPP 420 
421 PRPRPLPCSC ESAF 
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HSH2RJ 

D49783 

M32701 

U25440 

S57S6S 

U74716 
U64032 
L41147 
GPRv47 
D43633 



HSH2R_1 

D49783 

U32701 

U25440 

SS7565 

S73473 

1174716 

U64032 

L41I47 

CPRV47 

043633 



HSH2R_1 

D49783 

1132701 

U25440 

S5756S 

S73473 

11747 16 

U64032 

L41I47 

GPRv47 

D43633 



HSH2RJ 

D49783 

113 2701 

U2 5440 

S57565 

S73473 

1174716 

U64032 

L41147 

GPRv47 

D43633 



HTFRDLLSVTFEGPRPD I SAGGSGAGGGAGAGAGAGDTASSESPAVGGVPGAAGGGGGGS 



-uap— 



•HG- 
-NG- 
-NC- 
-HG~ 



-MAF- 
-KEP— — NG 



-TASSFCLDSTACK I T- 
-TASSFCLDSTACKIT- 
-TGSSFCLDSPPCRIT- 
-TVPSFCiDFTVYKVT- 
-TVH5CCLDSMALK Vi- 



lli 

—17V 
—I TV 
--VSV 
-ISV 
—15V 



IIAPIPHXNCS LAF1SDAPTLDPSAANTSGLPG-VPI-AAAL 

MAPIPWCNCS LAFISDAPTLDPSAAWTSCLPG-VPI-AAAL 

VVGAGSCEDNRSSAGEPCGAGGGGE VMGTAAVCGLVVSAQSVG VCV 



-yYP EPGPT ANSTPAIGAGPPSAPGGSG- 



-■VAA 



-HESSPI PQS5GNSS71GRVPQTPGPSTASGYPEVGLRDV-ASE5VALF 



-iillADKTSpy I TSDHS I SNFSTGLFGPHPTVPPDVGYVTSSQSQIIKOLFGLF 



III Till IIIIMM lllltlt TH2 #11111 

VLAVl I L I 7 VAGNVVVCLAVCLNRRLRNLYNCF I VSLA I TDLLLGLL VLPFSA I YQLSCK 
VLAVL 1 L 1 TVAGNYVVCLA VCLNRRLRNL7MCF I VSU I TOLLLGLLVLPFSA I YQLSCK 
VLTVL I L I T I AGNVVVCL AVGLNRRLRSL7HCF I VSLA I TDLLLGLLVLPFSAFYQLSCR 
I LI I L I LVTVAGHVVVCLAVGLHRRLRSL7MCF I VSLAVTDLLLCLLVLPFSA I YQLSCK 
VLT7L 1LIT IAGKVVVCLAVSLNRRLRSL7MCF I VSLAA70LLLGLLVLPFSA I YQLSF7 
AGALLALATVGGNLLV I TA I ARTPRLQT 1 7NVFYTSLATADLVVGLLVypPGATLALTQi 
AGALLALAT VGCHLLV I TAJ AR7PRLQT ITNVFVTSLATAOLYVCl L VMPPCATLALTGH 
FLAAF I LTAVACNILV I LSVACNRHLQTVTH YF I VNLAVADLLLSATVLPFSA7MEVLGF 
ALCVV I ALTAAAKSLL I AL I CTQPALRNTSNFFLYSLFTSDLMVCLV YMPPAMLNALYGR 
FMLLLOLTAVAGNAAVMAY I AK7PALRKFYFVFHLCLVDLLAAL71HPLAIILSSSALFDH 
CMVTLNL IALLAN7GVHVA I ARAPHLKKFAFVCHLCAYDVLCA I LLIIPLG 1 1 SSSPFFCT 

III §111111111 T»3 IIMIIII MMf TU4 

ISFGKVFCN»Y7SLDVIILC7ASILHLFyiSLDRYCAVIIDPLRYPVLVTPARVAISLVLII 
VSFGKVFCN I YTSL0VULC7AS I LNLFM I SLDRYCAVUDPLRYPVLVTPVRVA I SLVL 1 1 
1SFGXVFCN I Y TSL0VHLC7AS I LNLFi I SL0RYCAV7DPLR YP VL I TPVRVAVSLVL 1 1 
WSFSKVFCW I riSLOVIILCYAS I LHLFM I SLDR YCAVTDPLR YP VL J TPARVA ISLVFII 
1SFGHVFCN I YTSLDVHLC7AS I LHLFM I SLDR YCAVTDPLRYP VL V7P VRVA I SLVF I i 
tPLGATGCELITSVDVLCVTAS I ETLCALAVDRYLAVTNPLRYG7LVTKRRARAAVVLVI 
IPLGATGCELITSVDVLCVTASIETLCALAVDRYLAVTNPLRYGTLV7KRRARAAVVLVI 
WAFGRAFCD V8 AAVDVLCC7A5 I LSLCT I S VDR YVGVRHSLK YPA I MTERKAAA I LALL1 
I VLARGLCLLITAFDVUCCSAS ) LNLCL I SLDRYLL 1 LSPLR YKLRM7PLRALALVLGAI 
ALFGEVACRLYLFlSVCFVSLAiLSVSAINyERYYYVVHPIfRYEVRM7LGLVASVL¥CVI 
V VF7 1 LECQVY I FLKVFL I iLS I LT I TA I SVERYFY I VHP8RYEVKIIT I WLV I GVULL 1 1 
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HSH2RJ 

D49783 

M32701 

U25440 

S57SS5 

S73473 

U74716 

U64032 

L41147 

GPRv47 

043633 



HSH2RJ 

049783 

V32701 

U25440 

S57565 

S73473 

M74716 

U64032 

L41147 

GPRv47 

D43633 



HSH2RJ 

D497B3 

H32701 

U25440 

557565 

S73473 

M74716 

U64032 

L4H47 

GPRv47 

D43633 



HSH2JM 

049783 

M32701 

U25440 

S57565 

S73473 

U74716 

U64032 

L4I 147 

GPRv47 

D43633 



JtfSilttfffl % llllltl TU5 11111111 

VISITLSFLSIHLCIN — S RNETSK GNHTT SKCN YQVNEV YGLVDGLVTF YLPLL IHC I 

V I S I TLSFLS I HLGVN — SRNETSKCNHTTSKCK YQVNEVYCLVOCLVTFYLPLLI HC I 

V I S I TLSFLS I HLGWN — 5RNETSSFNHT IPKCK VQVNLVYGLVDGLVTFYLPLLVMC I 

V I S I TLSFLS I HLGWN — SRNETSKDNOT I VKCK VQVNEVYGLVOGLVTF YLPLL I HC I 

VISITLSFLSIHLGWN — SRNGTRGGN-DTFKCK YQVNEVYGLVDGLVTFYLPLLI MCV 

I VSATVSFAP I NSQIIRVGADAEAQECHSN PRCCS FASNNPYALLSSSVSF YLPLLVMLF 

IVSATYSFAPIHSQWIRVGADAEAIJECHSNPRCCS FASNHPYALLSSSYSFYLPLLVMLF 

AVALVVS-KGPLLGfK EPVPPD— ERFC G I TEEVGYAVFSSLCSFYLPHAV I VV 

SLAALASFLPLLLCIH ELCHARPPVPGOC RLLASLPFVLVASGLTFFLPSGAICF 

VKALAMASVPVLCRVS-WEEGAPSVPPG CSL015HSAYCQLFVVVFAYLYFLLPLLL f LV 

FKSLLLA-LVTLFCWPPYGHQSSIAASH CSLHASHSRLRGVFAVIFCVICFLAPYVYIFS 

t : :. • t : 



TYYRIFRVARDQAKRID-HIS— 
TYYRIFKVARDQAKRI N-HI 
TYYRIFK IARD9AKRIH-HMG— 
TYFR I FK 1 AREQARR I N-H I G— 
TY Y 8 1 FK I AREQAKR I N-H I S— 



SHKAATIR- 

— - — SfKAAT IR- 



-SWKAATIG- 
-SIKAATIR- 
-SWKAAT I R- 



-DCVPSCGR 
-DGVPSCGR 



VYARVFVVAKRQRRLLRRELCRFPPEESPRSPSRSPSPATVGTPTAS- 
YYARVFVVAKRQRRFVRRELCRFPPEESPRSPSRSPSPATVGTPTAS- 
BYCRVYYYARSTTRSLEAGVKRERGKASEVVLR IHCRGAASGADGAPGTRGAKGHTFRSS 

TYCR I LLAARKQAVQVASLTTG — HASQASETLQVPRTP — R — PGVESAOS 

VYCSWFRVARVAAMQHGPLPTWIIETP— -RQRSESLSSR— S— — TUVTSSGA 

VY SAVYK VARSAALQQVPAVPTBADAS PAKDRSDS I HSQTT 1 1 TTRTLP 

* : • - ,-• : t r/.-. : v* : ■ 

tflfjffff TH6 ttJftif lllllffllfl 

EHRATVTLAAVVGAF 1 1 C1F P YFT AF V YRGLRGODA INEMLEA I YL1LGY 

EHKATVTLAAVNCAFI ICiFPYFTAFVYRGLRGDDAINEVLEAl VllLGY 

EHKATVTLAAVMCAFI I CKFPYFTVFVYRGLKCODA INEAFEAYYLWLGY 

EHKATVTLAAVVGAF 1 1 CIFPYFTVFVYRGLKGDDAVNEVFEDVVLWLGY 

EHKATVTLAAVHGAF I I C1FPYFTAFVYRGLRGDDA INEAYEG I YLVLGY 

RPARLLPLG-EHRALRTLCL I UG I FSLCiLPFFLANV LRALVCPSL VPSGVF I ALNWLGY 
RPARLLPLC-EHRALRTLGL I MGI FSLCKLPFFLANVLRALVGPSLVPSGVF I ALN1LGY 
LSVRLLKFSREKKAAKTLAIVVCVFVLC1FPFFFVLPLCSLFPQLKPSECVFKVIFWLCV 
RRLAT KHSRKALKASLTLCI LLGMFFVT1LPFFVAN I VQAVC — DC ISPGLFOVLTILGY 
PQTTPHRTFGGGKAAVVLLAVGGQFLLCiLPYFSFHL YVALSAQP I STGQVESYVTI I GY 
QR LSPER AF5GGKAALTLAF I VGQFLVCILPFF I FHLQM SLTGS WKSPGDLEEAVNILA Y 

. : t:.t 

TH7 ttllltlfi 

ANSALNP I LYAALNRDFRTCYQQLFCCRLANRNSHKTSLRSNASQLSRTQSREP8 Q 

ANSALNP I LYAALNROFRTGYQQLFCCRLANRNSHKTSLRSNASQLSRTQSREPR Q 

ANSALNP I LYATLNRDFRTAYQQLFRCRPASHNAQETSLRSNSSQLARNQSREPM R 

ANSALNP I LYAALNRDFRTAYHQLFCCRLASHNSHETSLRLNNSQLNRSQCQEPR ■ 

ANSALNP I LYAALNRDFRTAYQQLFHCKFASHNSHKTSLRLNNSLLPRSQSREGR V 

ANSAFNPL I YCR-SPDFRDAFRRLL-CSYGGRGPEEP — RVVTFPASPVASR 

AN5AFNPLIYCR-SPDFRDAFRRLL-CSYGGRGPEEP — RVVTFPASPVASR 

FN5CVNPL I YPCSSREFKRAFLRLLRCQCRRRRRRRPLVRVYGHHVRASAGGGPHPDCAL 

CNSTUNP 1 I YPLFMRDFKRALGRFLPCPRCPRERQAS-LASPSLRTSHSGPRPGLS L 

FCFTSNPFFYCCLNRQIRCELSKQFVCFFKPAPEEELRLPSREGSIEENFLQ F 

5SFAVNPSF YGLLNRQ I RDELYKFRRCCVTQPVE I GP — SSLEGSFQENFLQ F 
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Figure 25 



HSH2R_1 

049783 

1132701 

U25440 

S57565 

S73473 

H747I6 

U64032 

L41147 

GPRv47 

043633 



QEEKPLKLQVBSGTEVT — 

QEEXPLKLQViSGYEVTAPQGATDR- 
QEEKPLKLQYiSGTEVTAPRGATDR- 
QEDKPLNLQVWSGTEVTAPQGATNR- 
QEEXPLKLQYISGTEtTHPQCNP I R- 
QNS-PLNR — FDGYEGERP-FPT — 
QNS-PLNR — FDGYECERP-FPT- 



$AGAALPGAALALTAAPAPSSAAAPEGQAAGACRRKPPCAFREWRLLGPLRRP7TQLRAK 
QQVLPLPLPPDSDSDSOACSCGSSGLRLTAQLLLPGEATQDPPLPTRAAAAVNFFNIDPA 
LQGTGCPSESiVSRPLPSPKQEPPAVDFRIPGQIAEETSEFLEQQLTSDI IliSDSYLRPA 
IQRTSSSSETHPSFANSNP-RNMENQAHKIPGQIPEEQA 



HSH2RJ 

049783 

U327D1 

U2S440 

S57565 

S73473 

M74716 

U64032 

L41147 

GPRv47 

D43633 



VSSLSHK I RAGGAQRAEAACALRSEVEAVALSVARDVAEDNTCQAYELADYRNIRETDI 

EPELRPHPLG I PTN ■ 

ASPRLES r- 
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BEST AVAILABLE COPY 
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igure 27 



GPRv51 
M35297 



GPRvSI 
H35297 



CPRvSI 
M35297 



GPRvSI 
U35297 



CPRv51 
M35297 



GPRvSI 
1135297 



fttlttltlf TM1 I 

-MNQTLNSSGTVESALNTSKGS'I WHlAf LVLSSLAfciH I CLLGIIA 



MAGNCSIEAHSTNQNKIICPGUSEALELYSRGFLT I EQ I ATLPPPAVTNY I FLLLCLCCLV 
**. .. * *: **** . . t. : : : ****:. 

tStttt ttttUit T1I2 Ittttttit 

GNSUV I 1LLGFRUHRNPFC I Y ! LNLAAADLLFLFSMASTLSLETQPLVN-TTDKVHELMK 
GNGLVLIFFGFS I KRTPFS I YFLHLASADG I YLFSKAV I ALLNHGTFLGSFPDYVRRVSR 
**.:*:#::** ::*.«.**:*:**:** ::*<** *: .::. .**:.: : 

Mfttttf TH3 tttltttt lllltMl TII4 IttfffM 

RLMYFAYTVGLSLLTA I STQRCLSVLFP I VFKCHRPRHLSAIVCGLLfTLCLLMNGL TSS 
I VGLCTFFAGVS L L PA I S I ERCVSV 1 FPHWYf RRRPKRLSAGVCALLWLLSFLVTS 1 HNY 
.•:«•.«* ■:«:*♦:«:'*: :**::«* . 

t Iftltfltt TH5 tlltltll 

FCSKFLKFNE-DRCFR VDMVQAAL I HGVLTPVHTLSSLTLFVIVRRSSQQIIRRQPTRLFV 
FCMFLGHEASGTACLNMDI SLGI LLFFLFCPLIIVLPCLALI LHVECRARRRQRS-AKLNH 
** : . . ::*:*.*..*:*_:: *. ::: :*. ::* 

tttittU TH6 tftlllfti tltlltllt TII7 ftltltl 

VVLASVLVFL I CSLPLS I YWFV LYILSLPPEBQVLCFSLSRLSSSVSSSANPV I YFLVGS 

VVLAIVSVFLVSSIYLGIDfFLFIVFQIP APFPEYVTDLCI CI NSSAKP I VYFLAGR 

**** t t.* **::: :.:« :: *. 



rrshrlptrslgtvlqqalre — epeleggetptvgtnevga 

dksqrliep-lrVvfqrairdgAepcdaasstpntvtheuqc 



:*:«* 
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Figure 28 




Amino acid residues 



BEST AVAILAPi .F — ~ 
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e 29 



(lllflllt Tit tt 

Y 14705 -yTSAESLLFTSLGPSPSSCOGDCRFNEEFKFILLPMSVAVVFVLGLAL 

AJ277752 IftSADSLLFTSLGPSPSSGDGDCKFNEEFKFILLPLSTAVVFVLGLAL 

AFQ3U97 — HDAPVRMFSLAPWTPTPTPfLCGNTTAAAEAKCVFNEEFKFILLPISYCIVFVVGLPL 
X99953 MTEDIMA7SYP7FLT7PYLPUKLLMNLTND7EDICVFDEGFKFLLLPVSYSAVFIIVGI.pl 
AF06 9555 ~ HSMANF7A — GR — ' — NSC7FQEEFKQVLLPLVYSVVFLLGLPL 



XaD lb j ——>—'—— ——»3ii Aiir 7 u Gn _ *~^ i rnctr r\nVcur lVT3 ■ if kLuCrL 

D63665 VERDNGTIQAPGLPP TTCVYREDFKRLLLPPVYSVYLYYGLPL 

CPRv7 , IfEK VDHNTSQ— EQ- CLCQFSEX YKQVYLSLAYS 1 1 F I LCLPL 

t : t :t : *. 

MSI* Ifllllli 7U2 flltlftl Itii 

Y 1 4705 KAPTLlLFLFfltRPiOATATYIIFHLALSDTLY VLSLPTLVY?YAARNH»PFCTCLCKFVR 

AJ277752 NAPTLILFLFRLRPIOATATYHFHLALSDTLYVLSLPTLVYYYAARNHIPFGTCFCKFVR 
AF03 1897 NSWAUV I FVSRMRPINATTTYMFNLA 1 5DTLYVFSLPTLYY YYAORNN*PFGKVFCK I VR 

X99953 WJAA«WiFIAK«RP«NPT7VYMFNLALSDTLYVLSLPTLVYYYADKNNBPFGEVLCfiCLVR 
AF06955S NAVV I GQ I ■LARKALTRT7 1 Y1ILNLATADLLYVCSLPLL i YNYTQKDYlfPFGDFTCKFVR 

X98283 NAVV I GQ I ILARKAL7R7TI YMLNLAMADLLYVCSLPLL 1 YNYTQKDYIPFGDFTCKFVR 

D63B65 NVCV I AQ I CASRRTLTRS AVYTLNLALADLLYACSLPLL I YNYARGDHffPFGOLACRLVR 

GPRv7) NGTVLWHFIGQTKRHSCATTYLVNUIVADLLYVL-LPFL 1 1 TYSLODRIPFGELLCKLVH 

* ::*.:♦ *# *: *: : *t»t 

8IITU3 IIIIIIMIIH Itfllftl TB4 llfll 

Y1470S FLFYINL YCSVLFLTC I SVHRYLG I CHPLRA1 Rf GRPR-FASLLCLGVILVVAGCLVPNL 

A J 277752 FLFYKNLYCSVLFtTC I SVHRYUG I CHPLRA I RIGRFR-FAGLLCLGYILVVAGCLVPNL 

AF03 ! 837 FLFYANL YSS i LFLTC f S VHR YHG I CHP I RSLKf VKTK-HARL I CVGVf L Wr I CL I PKL 

X99953 FLFYANLYSS I LFLTC J SVHRYRGVCHP I TSLRRKNAK-HAYV J CALVWLSVTLCLVPNL 

AF06 9555 FQFYTNLHGS ! LFLTC I SVQRYUGI CHPLASIHKXKGKKLT1LVCAAVVF I VI AQCLPTF 

X 98 283 FQFYTNLHGS I LFLTC I S VQR YMG I CHPLASIHKKKGKKL71L VCAA YIF I VI AQCLPTF 

D63S65 FLFYANLHGS I LFLTC I SFQRYLGI CHPLAPIHKRGGRRAAIVVCGVV1LVVTAQCLPTA 

GPRv71 FLFY I NLYCS I LLLTC I SVHQFLGVCHPLCSLPYRTRR-HA*LGTSTT*ALVVLQLLPTL 

It Ifftilltl TMS II88III 

Y 1 47 05 FFVTTNANGTT I LCHDTTIPEEFDHYVYFSSAVMVLLFCLPFL I TLVCYCLHARRLYRPL 

A J 27 7752 FFVTTNANGTT I LCHDTTLPEEFDHYVYFSST I MVLLFGFPFL I TLVCYCLHARRLYRPL 

AF031897 I FVTTSSKDNSTLCHDTTKPEEFDHYVHYSSS I MALLFGt PFLV I YVCYCLtf AKRLCKRS 

X 99953 I FVTVSPK YKNT I CHDTTRPEDFARY VEYSTA I HCLLFG I PCL 1 1 AGCYGUTRELUKP t 

AF05 955 5 VFASTGTQRN RTVCYOLSPPORSASYFPYG I TLT I TGFLLPFAA I LACYCSHAR I LCQKD 

X 982 83 VFASTCTQRNRTVCYDLSPPDRSTSYFPYC I TLT I TGFLLPFAA I LACYCSHAR I LCQKD 

D63665 VFAATG MJRNRTYCYDLSPP I LSTR YLPYGMALT V I GFLLPFTALLACYCRUARRLCRQO 

GPRv71 AFSHTDY I NGQM 1 1YDUTSQENFDRLFAYG I VLTLSGFLSLLGHFGVLFTDGQEPDQARG 

± ' m t • ft 

uttntt tub muun $$ 

Y 1 4705 PGAGQS SSRLRSLRT I AVVLTVFAVCFVPFH I TRT I YYQAR-LLQADCHYLN I VNVV 

A J 277752 PGAGQS SSRLRSLRT I AVVLTVFAVCFVPFH! TRT I YYLAR-LLNAECRVLK I VKVV 

AF031897 ^ FPSPSPRVPSYKKRSIKUI 1 1 VLT VFA I CFVPFH / TRTL Y f TSR-YFQACCQTLN I INFT 

X99953 VSGNQQTLPS YKKRS I KT I IFVHIAFAI CFMPFH I TRTLY VYAR-LLCI KCYALNY I HVT 

AF0S9555 EL I GLAVH-KKKDK AVRH 1 1 IVVIYFSI SFFPFHLTKT I YL I VRSSPTLPCPTLQAFA I A 

X 9828 3 ELI GLA VH-KKKDXA VRM II I V V I YFS I SFFPFHLTKT I YL I VRSSASLPCPTLQAFA I A 

D63665 GPAGPVAQ-ERRSKAARKAVVVAAVFV1SFLPFHITKTAYLAVRSTPGVSCPVLETFAAA 

GPRvTI EPHEDRQHSPSOVHPDHPTGVWPLHPLFCALPYHSLLLPHHLLSAFSGLPALDGSQCGLQ 
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igure 30 



Y1470S 

AJ277752 

AF031897 

X99953 

AF069555 

X98283 . 

D63665 

GPRv71 



fttf< TH7 IttlltSIt 

YKV7RPLASANSCLDPVLYLF7GDKYRNQLQQLCRGSK--PKPR TAASSL 

YKVTRPLASANSCLDPVLYLFTGDKYRNQLQQLCRGST — PKRR -TTASSL 

YK I TRPLAS I NSCLDP I LYFHAGDKYRGRLRRGAAQR P-R PVPTSL 

YKVTRPLASANSC I DP I LYFLANDRYRRRL I RTVRRRSSVPNRRCMHTNHPQTEPHIITAG 

YKCTRPFASMNSVLDPILFYFTQRKFRESTRYLIDKIIS-- — - — SKIRHD 

YKCTRPFASINSYLDPILFYFTQRKFRESTRYLLDKMS SKiRQD 

— AKIQRQ 
— EHPAGRK 



YKGTRPFASANSVLDPILFYFTQQKFRRQPHDLLQKLT— 
DHEASGECEQLPQPSPVLSFKGGICNRVRLLQKLRQNKLG- 



.»:t 



Y14705 

AJ277752 

AF031897 

X99953 

AF069S55 

X98283 

D63665 

GPRV71 



ALVTLHEES ISRIADTHQDSTFSAYEGDRL 

ALVTLHEES I SRWAD I HQDS I FPAYEGDRL — 

LALVSP SVDSSV VGSCCNSE : S RGIIGT VKS RGGQ : 

PLPV I SAEE I PSHGSWVRDENGEGSREHRVEWTDTKE I MQWUNRRST I KRNSTDKKDNKE 

HCJTYGS ~ 

HC I SYGS — ; : " 

RV : ; 

RCPGLNRSG ~ ' : 



Y1470S 

AJ277752 

AF031897 

X 99953 

AF069555 

X98283 

D63665 

GPRV71 



NRHGENYLPYVEVVEXEDYETKRENRKTTEQSSKTNAEQDELQTQtOSRLKRGKWQLSSK 



Y 14705 

AJ277752 

AF031897 

X99953 

AF069555 

X98283 

D63665 

GPRv71 



KGAAQEHEKGHHEPSFEGEGTSTIfNLLTPKyYGKKORLAKNVEEVGYGKEKELQNFPKA 
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Figure 31 

i 




' i. L. -1- — i- —I- ! ! ! ! >- 

1 51 101 151 201 251 301 351 401 451 501 

Amino acid residues 



BEST AVAILABLE OOpv 
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igure 32 



" mtitm Tyi imttii 

U03866 MVFLSCNAS — OSSNCTQPPAP — — VN I SKA I LLGV I LCGLI LFGVLGN I LV 

U]1U MVFLSGNAS — DSSNCTQPPAP-- — VN I SKA I LLGV I LGGLI LFGVLGN I LV 

02 523 S UVFLSGNAS — OSSNCTQPPAP VN I SKA I LLGV I LGGL I LFGVLCN I LV 

032202 UVFLSGNAS — OSSNCTQPPAP VN i SKA I LLGV I LGGL I LFGVLGN I LV 

032201 UVFLSGNAS— OSSNCTQPPAP-- — VN I SKA I LLGV I LGGL I LFGVLCN I LV 

AF013261 -MVFLSGNAS-DSSNCTQPPAP VN I SKA I LLGV I LGGL I LFGVLGN I LV 

U81982 --MVFLS GNAS--DSSNCTHPPAP- VN I SKA I LLGV I LGGL I LFGVLGN I LV . 

u07126 --IIVLLSENAS — EGSNCTHPPAP— — VN I SKA I LLGV I LGGL I IFGVLGNILV 

S7 1 32 3 MVPVLDNMTPSSVTL — NCSNCSHVLAPE LNTVKAVVLGN VLG I F I LFGV I CN I LV 

06 3859 — -HTPSSVTL — NCSNCSHVLAPE — --LNTVKAVVLGHVLGIF I LFGV I GNILV 

AF091890 — MSLNSSLSCRKELSNLTEEECC EGGVI ITQFI AI IVlT IFVCLGNLVI 

CPRv7Z MTSTCTNST--RESNSSHTCMPLSKMPI SLAHGI IRSTVLVIFLAASFVGNIVL 

. : . : • 

t|» iftltlfffl TU2 IKIIIItttl ittflttft 

U03866 I LSVACHRHLHSVTHYY I VNLAVADLLLTSTVLPFSA I FEVLGYIAFGRVFCN IIAAVDV 

L3 1 774 I LSVACHRHLHSVTHYY I VNLAVADLLLTSTVLPFSA I FEVLGYIAFGRVFCN I WAAVDV 

D25235 I LSVACHRHLHSVTHYY I VNLAVADLLLTSTVLPFSA I FEVLGYIAFGRVFCN I IAAVDV 

032202 I LSVACHRHLHSVTHYY I VNLAVADLLLTSTVLPFSA I FEVLGYIAFGRVFCN If AAVDV 
032201 | LSVACHRHLHSVTHYY I VNLAVADLLLTSTVLPFSA I FEVLGYIAFGRVFCN I IAAVDV 
AF013Z61 I LSVACHRHLHSVTHYY I VNLAVADLLLTSTVLPFSA I FEVLGYIAFGRVFCN I IAAVDV 
U81982 I LSVACHRHLHSVTHYY I VNLAVADLLLTSTVLPFSA 1 FE 1 LGYIAFGRVFCN I IAAVDV 
U07126 I LSVACHRHLHSVTHYY I VNLAVADLLLTSTVLPFSA I FE I LGYIAFGRVFCN I IAAVDV 
S7 1 323 I LSVVCHRHLQTVTYYF I VNLAVADLLLSSTVLPFSA I FE I LDRIVFGRVFCN I IAAVDV 
D63859 I LSVVCHRHLQTVTY YF I VNLAVADLLLSSTVLPFSA I FE I LDRIVFGRVFCN I IAAVDV 
AF0918SO VVTLYKKSYLLTLSNKFVFSLTLSNFLLSVLVLPFVVTSS I RREIIFGVVICNFSALLYL 
GPRV72 ALVLQRKPQLLQVTNRF IFNLLVTDLLQ ISLVAPIVVATSVPLFIPLNSHFCTALVSLTH 

: : : * v: *-:. :*. - : 

f TM3 Uftf ttl* SttmtfSf TM4 #11 !•#«# 

U0386 6 LCCTAS I UGLC 1 1 S I DRY I GVSYPLRYPT I VTQRRCLMALLCVIALSLV I S I GPLFGIR- 

L31774 LCCTASIMGLCI I S I DRY I GVSYPLRYPT I VTQRRGLMALLCVIALSLV IS I GPLFGIR- 

D2 52 3 5 LCCTAS I UGLC I I S I DRY I GVSYPLRYPT I VTQRRGLNALLCVIALSLV I S I GPLFGIR- 

03 2202 LCCTAS I MGLC 1 1 S I DRY I GVSYPLRYPT I VTQRRGLMALLCVIALSLV I S I GPLFGIR- 

D32201 LCCTAS I MGLC 1 1 S I DRY I GVSYPLRYPT I VTQRRGLMALLCVIALSLV IS I GPLFGIR- 

AF01 326 1 LCCTAS I MGLC 1 1 S I DRY I GVSYPLRYPT I VTQRRGLMALLCVIALSLV I S I GPLFGIR- 

U81982 LCCTAS 1 1 SLCV 1 S I DRY I GVSYPLRYPT I VTQRRGLRALLCVIAFSLV I SVGPLFGIR- 

U07 1 26 LCCTAS I MGLC 1 1 S I DRY I GVSYPLRYPT I VTQRRGVRALLCVIVLSLV I S I GPLFGIR- 

S7 1 3 23 LCCTAS I MSLCV (SVDRY I GVSYPLRYP AIMTKRRALLAVMLLIVLSV I IS! GPLFGIK- 

D6 38 5 9 LCCTAS I MSLCV I SVDRY I GVS YPLRYPA I MTKRRALLAVMLLIVLS V 1 1 S I GPLFGIK- . 

AF09 1 890 LI SSASHLTLGV I A I DR Y YAVLYPM VYPMK I TGNRAVMALVY I ILHSL I GCLPPLFGISS 

GPRv72 ^ LFAFASVNT I VVVSVDR YLS 1 1 HPLS YPSKNTQRRGYLLLYGTI I VA I LQSTPPLYGiGQ 
* . »*: :::::»**.::*:** :* * ::: . **:*» 
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Figure 33 



U03B66 

D25235 

D32202 

D32201 

AF013261 

U81982 

U07126 

S71323 

D63859 

AF091890 

CPRv72 



U03866 

L31774 

D25235 

D32202 

032201 

AF013261 

U81982 

U07126 

S71323 

D63859 

AF091890 

GPRV72 



U03866 

L31774 

D2S23S 

D32202 

D32201 

AFO 13261 

U81982 

U07126 

S71323 

D638S9 

AF091890 

GPRv72 



JJ # IIIM«NI TM5 tttttitttt 

QPAPEDET I CQ I NE — EPGYVLFSALGSFYLPLA 1 1 LVMYCRVYVVAKRESRGLK S 

yrAPEDET i CQ i tic — crGYYLrSALGSFTLFLA i i LYaiVCRY YVYAKRESnGLK 3 

QPAPEDET I CQ l ME — EPGYVLFSALGSFYLPLA I I LVMYCRVYVVAKRESRGLK — --S 

QPAPEDET I CQ I NE — EPGYVLFSALGSFYLPLA 1 1 LVMYCRVYVVAKRESRGLK S 

QPAPEDET I CQ I NE— EPGYVLFSALGSFYLPLA 1 1 LVMYCRVYVVAKRESRGLK S 

QPAPEDET I CQ I NE— EPGYVLFSALGSFYLPLA 1 1 LVMYCRVYVVAKRESRGLK S 

QPAPDDET I CQ INE— EPGYYLFSALGSFYVPLT 1 1 LAMYCRVYVVAKRESRGLK--— S 

QPAPEDET I CQ I ME — EPGYVLFSALGSFYViPLA 1 1 LVMYCRVYVVAKRESRGLK S 

EP APEDETVCK I TE — EPGYA I FSAVGSFYLPLA 1 1 LAM YCRVY VVAQKESRGLK E 

EPAPEDETVCK I TE— EPGYA I FSAVGSFYLPLA 1 1 LAMYCRVY VVAQKESRGLK— E 
VEFDEFKWMCVAAiHREPGYTAFWQilCALFPFLVMLVCYGFIFRVARVKARKVH-— C 
AAFDERNALCSM I WGASPSYT I LS VVSF I V I PL I VM I ACYSVVFCAARRQHALLYNVKRH 
• • - ± *....**.. 

a a • ^ ■ ~ • ~ « « ■ • ~ • • • • • ^ • • • ~ a i • 

tttttttt 

GLKTDKSDSEQVTLRIHRKNAPAGGSGMASAKTKTHFSVRLLKFSREICKAAICTLGI VVG- 
GLKTDKSDSEQVTLRIHRKNAPAGGSGMASAKTKTHFSVRLLKFSREKKAAKTLGIVYG- 
GLKTDKSDSEQVTLRIHRKNAPAGGSGMASAKTKTHFSVRLLKFSREKKAAKTLGIVVG- 
GLKTDKSDSEQVTLRIHRKNAPAGGSGHASAKTKTHFSVRLLKFSREKKAAKTLGI VVG- 
GLKTDKSDSEQVTLRIHRKNAPAGGSGUASAKTKTHFSVRLLKFSREKKAAKTLGI VVG- 
GLKTDKSDSEQVTLRIHRKNAPAGGSGMASAKTKTHFSVRLLKFSREKKAAKTLGIVVG- 
GLKTDKSDSEQVTLRIHRKNAPAGGSGVASAKNKTHFSVRLLKFSREKKAAKTLGI VVG- 
GLKTDKSDSEQVTLRIHRKNVPAEGGGVSSAKNKTHFSVRLLKFSREKKAAKTLGI VVG- 
GQK I EKSDSEQV I LRMHRGNTTVSED-^EALRSRTHFALRLLKFSREKKAAKTLG I VYG- 
GQK I EKSDSEQV I LRMHRGNTTVSED — EALRSRTHFALRLLKFSREKKAAKTLG I VVG- 

GTVV I VEEDAQRTG RKNSSTSTS — SSGSRRNAFQGWYSANQCK-ALiTl LVVLG- 

SLEVRVKDCVENEDEEGAEKKEEFQD — ESEFRRQHEGEVKAKEGRMEAKDGSLKAKEGS 

a m. a • • »•« • ■ ' 

tlttntl TM6 ttttttittttttit fttttitttttt TU7 

CFVLCWLP FFLVMP I GSFFPD-— FKPSETVFK I VFf LGYLNSC I N 

CFVLCWLP FFLVMP I GSFFPD FKPSETVFK I VFWLGYLNSC J N 

CFVLCWLP FFLVMP I GSFFPD FKPSETVFK I VFILGYLNSC I N- — 

CFVLCWLP -FFLVMP I GSFFPD FKPSETVFK I VFWLGYLNSC I N 

CFVLCWLP — ' — FFLVMP I GSFFPD FKPSETVFK I VFILGYLNSC I N 

CFVLCWLP FFLVMP I GSFFPD FKPSETVFK I VFILGYLNSC I N 

CFVLCWLP- FFLVMP I GSFFPD — FKPPETVFK I VFWLCYLNSC 1 H 

CFVLCWLP FFLVMP I GSFFPD FKPSETVFK I VFWLGYLNSC I N 

CFVLCWLP FFLVLP I GS I FPA YRPSDTVFK I TFWLGYFNSC I N 

CFVLCWLP -FFLVLP I GS I FPA YRPSDTVFK I TFWLGYFNSC i N 

AFMVTWGP YUVV I ASEALWGK SSVSPSLETVATWLSFASAVCH 

TGTSESSVEARCSEEVRESSTVASDGSMEGKEGSTXVEENSMKADKGRTEVNQCS IDLGE 

♦ . 

• * » -a* a 
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Figure 34 



U03866 

L31774 

D25235 

D32202 

D32201 

AF0132S1 

U81S82 

U07126 

S71323 

D63859 

AF091890 

CPRv72 



U038E6 

L31774 

D25235 

D32202 

032201 

AF0132S1 

U81982 

U07126 

S71323 

063859 

AF091890 

GPRv72 



U038S6 

L31774 

D2S23S 

032202 

D32201 

AF013261 

U81982 

U07126 

S71323 

D638S9 

AF09I890 

GPRv72 

U03866 

131774 

D2S235 

032202 

032201 

AF013261 

U81982 

U07126 

S71323 

063859 

AF091890 

GPRv72 



If tf 8til 

-PI I YPCSSQEFK KAFQNVLR I QCLCRKQSSKH — ALCYT-LHPPSQAVECQHK- 

-Pl I YPCSSQEFK KAFQNVLR I QCLRRKQSSKH — ALGYT-LHPPSQAVEGQHK- 

-P 1 1 YPCSSQEFK KAFQNVLR I QCLRRKQSSKH — ALGYT-LHPPSQAVEGQHK- 

-P 1 1 YPCSSQEFK KAFQNVLR I QCLRRKQSSKH — ALGYT-LHPPSQAVEGQHK- 

-Pl I YPCSSQEFK-' — KAFQNVLR I QCLRRKQSSKH — ALGYT-LHPPSQAVEGQHK- 

-Pl I YPCSSQEFK KAFQNVLR I QCLCRKQSSKH — ALGYT-LHPPSQAVECQHK- 

-P 1 1 YPCSSQEFK KAFQNVLK I QCLRRKQSSKH — ALGYT-LHAPSQALEGQHK- 

-P 1 1 YPCSSQEFK KAFQNVLR I QCLRRRQSSKH — ALGVT-LHPPSQALEGQHR- 

-P 1 1 YLCSNQEFK KAFQSLLGVHCLRMTPRAHHHHLSVGQSQTQGHSLT i SLDSKG 

-P 1 1 YLCSNQEFK K AFQS LLG VHCLRMT PR AHHHHLS VGQ SQTQGH SLT I SLDSKG 

-PL I YGLWN KTVRKELLGMCFGDRYYREP — — FVQR — QRTSRLFSISHR- 

DOUEFGEDD I NFSEDDVEAVN I PESLPPSRRNSNSNP: — PLPRCYQCKAAKV I F 1 1 IFS 



DMVR I PVGSRETFYR I SKTDG— VCEIKFFSSMPRGSAR I TVSKDQS— SCTTARVRSKS 
OHVR I PVGSRETFYR I SKTDG-- VCEIKFFSSMPRGSAR I T VSKDQS— SCTTARVRSKS 
DUVR I PVGSRETFYR I SKTDG— VCEIKFFSSMPRGSAR I TVSKDQS — SCTTARVRSKS 
DUVR I PVGSRETFYR I SKTDG — VCEIKFFSSMPRGSAR I TVSKOQS — SCTTARTKSRS 
DUVR I PVGSRETFYR I SKTDG — VCEIKFFSSMPRGSAR I TVSKDQS — SCTTARGHTPH 
OUVR I PVGSRETFYR I SKTDG— VCEIKFFSSMPRGSAR I TVSKDQS — SCTTARRGMOC 
DUVR I PVGSGETFYK I SKTDG — VCEIKFFSSMPRGSAR I TVPKDQS — ACTTARVRSKS 
DUVR I PVGSGETFYK I SKTDG — VCEIKFFSSMPQGSAR I TVPKDQS — ACTTARVRSKS 
APCRLSPSSSVALSRTPSSRO— SREIRVFSGGPINSG— PGPTEAG— RAKVAKLCNKS 
APCRLSPSSSVALSRTPSSRO — SREIRVFSGGPINSG — PGPTEAC — RAKVAKLCNKS 

- 1 TDLGLSPHLTALMAG GQPLGHS — SSTGDTG — FSCSQDSGN — 

Y VLSLGPYCFLAVLAV1VDVETQVPQIV I T 1 1 1 WLFFLQCC I HP YVYGYUHKT I KKE I QD 



FLQVCCCVGPS-TPSLDKN— HQVPT I KVHT I SLSENGEEV '■ 

FLQVCCCVGPS-TPSLOKN — HQVPTIKVHTI SLSENGEEV 

FLEVCCCVGPS-TPSLOKM — HQVPT I KVHT I SLSENGEEV 

VTRLECSG — ~M I LAHCN — LRLPGS RDSPASASQAAGTTGDVPPGR RHQ AQL I FVFLV 



RYFTKNCR- 



-EH I KHVN— FUMPPIRKGLEC- 



FLQVCCCVGPS-TPNPGEN — HQVPT I K I HT I SLSENGEEV 

FLQVCCCVGSS-APRPEEM — HQVPT I K I HT I SLGENGEEV 

LHRTCCCI LRARTPTQDPAPLGDLPT I KIHQLSLSEKGESV 

LHRTCCC1 LRARTPTQOPAPLGOLPT I K I HQLSLSEKGESV 

-LRAL " 

MLKKFFCKEK — PPKEDSH — POLPGTEGGTEGK I VPSYOSATFP- 



ETGFHHVGQDDLDLLTS 



jsnnrin: <sp 124364BA1 i > 
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This international search rcpon has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 
Claims Nos.: 16 

because they relate to subject rnaner not required to be searched by this Authority, namely: 

The invention as set forth in claim 16 pertains to methods for diagnosis of 
dieeaaes and thus relates to a subject matter which this International Searching 
Authority is not required, under the provisions of Article 17 (2). (a) (i) of the 
PCT and Rule 3 9.1 (iv) of the Regulations under the PCT, to search. 

| I Claims Nos.: 

because they relate to parts of the international application that do not comply with the prescribed requirements to such an 
extent that no meaningful international search can be carried out, specifically: 



3. [Zl Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Observations where unity of Invention is lacking (Continuation of Item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 

'* The inventions as set forth in claims 1 to 15 and 17 are divided into groups 
of 9 individual inventions, i.e. , inventions relating to DNAs encoding the amino 
acids of SEQ IDNOS:l to 4 and 17 to 21, and DNAs having the sequences of SEQ 
IB NOS: 5 to B and 22 to 26.- These groups of inventions are not considered as 
relating to a group of inventions so linked as to form a single general inventive 
concept. 
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claims. 

2. Q] As all searchable claims could be searched without efibrt justifying an additional fee, this Authority did not invite payment 

of any additional fee. 

3. [] As only some of the required additional search fees were timely paid by the applicant, this international search report covers 

only those claims for which fees were paid, specifically claims Nos.: 



4. ^) No required additional search fees were timely paid by the applicant. Consequently, this international 
search report is restricted to the invention first mentioned in Ifac claims; it is covered by claims Nos.: 

Claims 1 to IS and 17 (inventions relating to the DNA encoding the 
amino acid sequence of SEQ ID N0:1 and the DNA having, the sequence 
of SEQ ID NO:S) , 

Remark on Protest Q The additional search fees were accompanied by the applicant's protest. 

n No protest accompanied the payment of additional search fees. 
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